← 返回首页
RT X Freeze: Most AI models hallucinate more than you'd think and make up stuff that doesn't exist Grok 4.20 just ranked #1 in Non-Hallucination Rate ...
RT X Freeze
Most AI models hallucinate more than you'd think and make up stuff that doesn't exist
Grok 4.20 just ranked #1 in Non-Hallucination Rate with a 78% score - beating Claude Opus 4.6(max), Gemini 3.1, GPT-5.4(xhigh), and every other model on the list
xAI is quietly winning the accuracy game… and it’s built to be truthful
Most AI models hallucinate more than you'd think and make up stuff that doesn't exist
Grok 4.20 just ranked #1 in Non-Hallucination Rate with a 78% score - beating Claude Opus 4.6(max), Gemini 3.1, GPT-5.4(xhigh), and every other model on the list
xAI is quietly winning the accuracy game… and it’s built to be truthful