← 返回首页

RT X Freeze: Grok 4.20 Multi-Agent just ranked #1 on Search Arena (Style Control) Grok 4.20 set a new industry record on AA-Omniscience with 78% accur...

来源:马斯克X | 发布时间:2026-04-05 16:02
RT X Freeze
Grok 4.20 Multi-Agent just ranked #1 on Search Arena (Style Control)

Grok 4.20 set a new industry record on AA-Omniscience with 78% accuracy - the lowest hallucination rate ever recorded on the benchmark

Beats Claude Opus 4.6 & Gemini 3.1 Pro