When Gemini 3 Flash doesn't know the answer, he just makes it up.

When Gemini 3 Flash doesn't know the answer, he just makes it up.

Gemini 3 Flash is a model of fast and smart artificial intelligence. But, according to an assessment made by an independent test group, if you ask them something that he doesn't really know is incomprehensible, difficult or out of his knowledge he will almost always try [...]

But, according to an assessment made by an independent test group, if you ask them something that he doesn't really know about, hard or out of his knowledge, he'll almost always try to respond by lying or inventing something.

In the tests of “the degree of hallucinations” (recognition rate) at the Benchmark entry AA @Omniscience, Gemini 3 Flash reached a 91 percent fush, which means that even when there was no correct answer, he answered anyway, and often it was completely invented.

This phenomenon of “is a known problem in text generation patterns: to know when to stop and say “does not know” is as important as knowing how to answer. According to this test, Gemini doesn't do this very well, reports Telegraph, broadcast Periscope.

This does not mean, however, that 91 percent of his answers are wrong. This figure just shows how often he invents something in situations when the real answer would be “doesn't know”.

Even though Gemini 3 Flash can be very powerful and perform well on general tests, he is very self - confident even when he should be careful that it may be a problem in serious use. /Periscope

Related
Britain to use artificial intelligence to verify the age of asylum seekers

Britain to use artificial intelligence to verify the age of asylum seekers

Good news from YouTube: Videos with artificial intelligence will be clearly labeled

Good news from YouTube: Videos with artificial intelligence will be clearly labeled

EU fines Chinese giant Temu at 200m euros for dangerous children's toys and damaged chargers

EU fines Chinese giant Temu at 200m euros for dangerous children's toys and damaged chargers

The Internet has been partially restored to Iran, says organisation overseer

The Internet has been partially restored to Iran, says organisation overseer

The Ferrari represents the first electric car, it costs $640,000.

The Ferrari represents the first electric car, it costs $640,000.

Stellantis presents ambitious plan for new models

Stellantis presents ambitious plan for new models

Why doesn't gold rust? Scientists detect “atomic reasoning” following the endurance of precious metal

Why doesn't gold rust? Scientists detect “atomic reasoning” following the endurance of precious metal

Musk loses battle for OpenAI control, court gives Altman justice

Musk loses battle for OpenAI control, court gives Altman justice

Mercedes - AMG discovered its first four-door electric vault

Mercedes - AMG discovered its first four-door electric vault

This Toyota model fails on security tests

This Toyota model fails on security tests

The pilot robot “mecha” appears on the market

The pilot robot “mecha” appears on the market

Bitcoin falls below $77,000

Bitcoin falls below $77,000

Instagram criticized for “Instances”

Instagram criticized for “Instances”