AI search engines fail accuracy test, study finds 60% error rate

Hallucinations and doubling down on wrong information have been an ongoing struggle for developers.

A research team claims it now has those numbers.

They tested each for accuracy and recorded how frequently the tools refused to answer.

The researchers randomly chose 200 news articles from 20 news publishers (10 each).

Collectively, AI search engines are inaccurate 60 percent of the time.

Furthermore, these wrong results were reinforced by the AI’s “confidence” in them.

While some examples were adversarial queries, many were just general questions.

Even when admitting it was wrong, ChatGPT would follow up that admission with more fabricated information.

The LLM is seemingly programmed to answer every user input at all costs.

However, it only achieved a 28-percent completely accurate rating and was completely inaccurate 57 percent of the time.

ChatGPT isn’t even the worst of the bunch.

Both versions of X’s Grok AI performed poorly, with Grok-3 Search being 94 percent inaccurate.

Talk about a con.

However, not everyone agrees.

TechRadar’s Lance Ulanoff said he might never use Google again after trying ChatGPT Search.

Hedescribesthe tool as fast, aware, and accurate, with a clean, ad-free interface.

Featured on TechSpot#