Don't trust generic "99% accurate" claims. Hallucination metrics vary wildly...
https://wool-wiki.win/index.php/Why_did_Grok-3_score_94%25_citation_errors_on_news_queries%3F
Don't trust generic "99% accurate" claims. Hallucination metrics vary wildly depending on the test. If you use Vectara's HHEM to measure grounding, you see one reality; apply AA-Omniscience for logic, and the picture shifts entirely