← 返回首页

RT X Freeze: Most AI models hallucinate more than you'd think and make up stuff that doesn't exist Grok 4.20 just ranked #1 in Non-Hallucination Rate ...

来源:马斯克X | 发布时间:2026-03-24 17:56
RT X Freeze
Most AI models hallucinate more than you'd think and make up stuff that doesn't exist

Grok 4.20 just ranked #1 in Non-Hallucination Rate with a 78% score - beating Claude Opus 4.6(max), Gemini 3.1, GPT-5.4(xhigh), and every other model on the list

xAI is quietly winning the accuracy game… and it’s built to be truthful