Benchmark

xAI — reported benchmarks

xAI · source page ↗ · last checked Jul 3, 2026, 6:02 AM

Reported benchmarks · Grok 4.3

captured Jul 3, 2026, 6:02 AM

Benchmark	Score
GPQA Diamond	90.1% · Graduate-level science reasoning; from Artificial Analysis and multiple sources
Tau-Bench (τ²-Bench)	97.7% · Tool-use and agentic benchmark
GDPval-AA	1500 Elo · Agentic task performance; xAI-reported improvement of 321 points from Grok 4.20
Artificial Analysis Intelligence Index	38 index · High reasoning mode on v4.1; composite of 9 evaluations
SciCode	47.3% · Code generation and problem-solving

Vendor-reported via automated web search — not independently verified. See the cited matrix on /models.

In the news · xAI

NVIDIA's chips. xAI's lease. Apollo's paper. And the wealth-channel fund holding $621 million of it: Apollo's ADS Dissected via Substack — NVIDIA's chips. xAI's lease. Apollo's paper. And the wealth-channel fund holding $621 million of it: Apollo's ADS Dissected Substack · Jul 5, 2026, 3:29 PM
SpaceX offers half-price Starlink in Tennessee as xAI faces a Clean Air Act lawsuit via Yahoo — SpaceX offers half-price Starlink in Tennessee as xAI faces a Clean Air Act lawsuit Yahoo · Jul 5, 2026, 10:19 AM
ICYMI: xAI debuts Grok Voice Agent Builder for Enterprises via TestingCatalog AI News — ICYMI: xAI debuts Grok Voice Agent Builder for Enterprises TestingCatalog AI News · Jul 4, 2026, 3:15 PM
Rehabilitation work of dams launched in Xai-Xai via aimnews.org — Rehabilitation work of dams launched in Xai-Xai aimnews.org · Jul 3, 2026, 8:24 AM
SpaceX's Cursor Bet Tests AI Model Neutrality Post-Acquisition via The Tech Buzz — SpaceX's Cursor Bet Tests AI Model Neutrality Post-Acquisition The Tech Buzz · Jul 2, 2026, 7:22 PM
Elon Musk's xAI Unveils No-Code Tool to Build AI Call Centers Capable of Cloning Human Voices via finance.biggo.com — Elon Musk's xAI Unveils No-Code Tool to Build AI Call Centers Capable of Cloning Human Voices finance.biggo.com · Jul 2, 2026, 3:06 AM
SpaceX offers half-price Starlink in Memphis amid backlash over xAI data centre via 디지털투데이 — SpaceX offers half-price Starlink in Memphis amid backlash over xAI data centre 디지털투데이 · Jul 2, 2026, 1:57 AM
xAI has released 'Voice Agent Builder,' a tool that allows users to create an AI call center with a cloned human voice without coding. via GIGAZINE — xAI has released 'Voice Agent Builder,' a tool that allows users to create an AI call center with a cloned human voice without coding. GIGAZINE · Jul 2, 2026, 1:46 AM

Importance-filtered press coverage (Google News) mentioning xAI. Headlines link to the original; verify before acting.

Change history

Vendor claim Jul 3, 2026, 6:02 AM

xAI reported benchmarks updated

Grok 4.3: 5 benchmark claims (via web search)
Vendor claim Jun 25, 2026, 6:53 PM

xAI reported benchmarks updated

Grok 4: 3 benchmark claims (via web search)