Hallucination rates depend on the yardstick. Whether you measure via Vectara...

https://spark-wiki.win/index.php/Which_Benchmark_Should_You_Cite_for_Multi-Turn_Chat_Apps_with_Citations%3F

Hallucination rates depend on the yardstick. Whether you measure via Vectara HHEM or HalluHard—where models hit 30.2% failure—results vary wildly. In 2026, don’t ask if your AI is accurate; ask which benchmark fits your use case

Submitted on 2026-05-18 08:02:13