Change feed
Every pricing change and breaking deprecation we've detected, newest first.
-
Tracking since Jun 25, 2026 · 4 snapshots on file
Model naming standardized: 'Flash-Lite' renamed to 'Flash Lite' across all tiers in batch enqueued token tables.
Full history & current snapshot →View raw diff +15 −15
- | Gemini 3.1 Flash-Lite | 10,000,000 | - | Gemini 3.1 Flash-Lite Preview | 10,000,000 | - | Gemini 2.5 Flash-Lite | 10,000,000 | - | Gemini 2.5 Flash-Lite Preview | 10,000,000 | - | Gemini 2.0 Flash-Lite | 10,000,000 | - | Gemini 3.1 Flash-Lite | 500,000,000 | - | Gemini 3.1 Flash-Lite Preview | 500,000,000 | - | Gemini 2.5 Flash-Lite | 500,000,000 | - | Gemini 2.5 Flash-Lite Preview | 500,000,000 | - | Gemini 2.0 Flash-Lite | 1,000,000,000 | - | Gemini 3.1 Flash-Lite | 1,000,000,000 | - | Gemini 3.1 Flash-Lite Preview | 1,000,000,000 | - | Gemini 2.5 Flash-Lite | 1,000,000,000 | - | Gemini 2.5 Flash-Lite Preview | 1,000,000,000 | - | Gemini 2.0 Flash-Lite | 5,000,000,000 | + | Gemini 3.1 Flash Lite | 10,000,000 | + | Gemini 3.1 Flash Lite Preview | 10,000,000 | + | Gemini 2.5 Flash Lite | 10,000,000 | + | Gemini 2.5 Flash Lite Preview | 10,000,000 | + | Gemini 2.0 Flash Lite | 10,000,000 | + | Gemini 3.1 Flash Lite | 500,000,000 | + | Gemini 3.1 Flash Lite Preview | 500,000,000 | + | Gemini 2.5 Flash Lite | 500,000,000 | + | Gemini 2.5 Flash Lite Preview | 500,000,000 | + | Gemini 2.0 Flash Lite | 1,000,000,000 | + | Gemini 3.1 Flash Lite | 1,000,000,000 | + | Gemini 3.1 Flash Lite Preview | 1,000,000,000 | + | Gemini 2.5 Flash Lite | 1,000,000,000 | + | Gemini 2.5 Flash Lite Preview | 1,000,000,000 | + | Gemini 2.0 Flash Lite | 5,000,000,000 |
-
xAI reported benchmarks updated
Tracking since Jun 25, 2026 · 2 snapshots on file
Grok 4.3: 5 benchmark claims (via web search)
Full history & current snapshot → -
Tracking since Jun 25, 2026 · 3 snapshots on file
Arena Elo (text, overall) updated.
Full history & current snapshot →View raw diff +21 −21
- claude-fable-5 1494 - claude-opus-4-6-thinking 1500 - claude-opus-4-8 1456 - deepseek-v4-pro 1449 - deepseek-v4-pro-thinking 1446 - gemini-3.5-flash 1480 - gemma-4-26b-a4b 1435 - glm-5 1446 - glm-5.1 1468 - glm-5.2 (max) 1465 - gpt-5.2 1411 - gpt-5.4 1454 - gpt-5.5-high 1468 - grok-4.1-thinking 1437 - grok-4.20-beta-0309-reasoning 1455 - mimo-v2-omni 1423 - mimo-v2.5 1427 - minimax-m3 1440 - muse-spark 1472 - nvidia-nemotron-3-ultra-550b-a55b-nvfp4 1439 - qwen3.7-plus 1463 + claude-fable-5 1495 + claude-opus-4-6-thinking 1501 + claude-opus-4-8 1454 + claude-sonnet-5-thinking 1445 + deepseek-v4-pro 1450 + deepseek-v4-pro-thinking 1447 + gemini-3.5-flash 1482 + gemma-4-26b-a4b 1434 + glm-5 1445 + glm-5.1 1467 + glm-5.2 (max) 1463 + gpt-5.4 1453 + gpt-5.5-high 1469 + grok-4.1-thinking 1438 + grok-4.20-beta-0309-reasoning 1454 + mimo-v2-omni 1422 + mimo-v2.5 1426 + minimax-m3 1439 + muse-spark 1473 + nvidia-nemotron-3-ultra-550b-a55b-nvfp4 1438 + qwen3.7-plus 1462
-
NVIDIA Nemotron models changed
Tracking since Jun 25, 2026 · 6 snapshots on file
Added nvidia/MiniMax-M2.7-DFlash model with other license type, available until 2026-06-16.
- Model: nvidia/MiniMax-M2.7-DFlash
- License: other
- Availability date: 2026-06-16
Full history & current snapshot →View raw diff +1 −0
+ nvidia/MiniMax-M2.7-DFlash license:other 2026-06-16 -
Meta reported benchmarks updated
Tracking since Jun 25, 2026 · 2 snapshots on file
Llama 4 Maverick: 8 benchmark claims (via web search)
Full history & current snapshot → -
deepseek-coder-1.3b-base: Epoch Capabilities Index (ECI) ↑ 63.596 → 63.813
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Cerebras-GPT-13B: Epoch Capabilities Index (ECI) ↑ 82.645 → 82.79
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
starcoder2-3b: Epoch Capabilities Index (ECI) ↑ 88.205 → 88.329
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
deepseek-coder-6.7b-base: Epoch Capabilities Index (ECI) ↑ 89.062 → 89.184
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
dolly-v2-12b: Epoch Capabilities Index (ECI) ↑ 89.113 → 89.235
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Baichuan-7B: Epoch Capabilities Index (ECI) ↑ 89.891 → 90.006
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
phi-1_5: Epoch Capabilities Index (ECI) ↑ 91.015 → 91.132
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
xgen-7b-8k-base: Epoch Capabilities Index (ECI) ↑ 92.881 → 92.989
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
starcoder2-7b: Epoch Capabilities Index (ECI) ↑ 93.025 → 93.132
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemma-2b: Epoch Capabilities Index (ECI) ↑ 93.684 → 93.789
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
mpt-7b: Epoch Capabilities Index (ECI) ↑ 94.078 → 94.181
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
falcon-7b: Epoch Capabilities Index (ECI) ↑ 94.588 → 94.689
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
deepseek-coder-33b-base: Epoch Capabilities Index (ECI) ↑ 95.816 → 95.912
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Baichuan-2-7B-Base: Epoch Capabilities Index (ECI) ↑ 95.829 → 95.924
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
LLaMA-7B: Epoch Capabilities Index (ECI) ↑ 96.2 → 96.296
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-2-7b: Epoch Capabilities Index (ECI) ↑ 98.591 → 98.678
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
LLaMA-13B: Epoch Capabilities Index (ECI) ↑ 100.124 → 100.207
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
mpt-30b: Epoch Capabilities Index (ECI) ↑ 100.202 → 100.284
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
mpt-30b-instruct: Epoch Capabilities Index (ECI) ↑ 100.202 → 100.284
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
INTELLECT-1-Instruct: Epoch Capabilities Index (ECI) ↑ 100.403 → 100.484
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen2.5-Coder-1.5B: Epoch Capabilities Index (ECI) ↑ 102.557 → 102.629
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Baichuan-2-13B-Base: Epoch Capabilities Index (ECI) ↑ 102.769 → 102.841
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
falcon-40b-instruct: Epoch Capabilities Index (ECI) ↑ 104.042 → 104.11
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
falcon-40b: Epoch Capabilities Index (ECI) ↑ 104.042 → 104.11
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Yi-6B-Chat: Epoch Capabilities Index (ECI) ↑ 104.36 → 104.427
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Yi-6B: Epoch Capabilities Index (ECI) ↑ 104.36 → 104.427
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
starcoder2-15b: Epoch Capabilities Index (ECI) ↑ 104.587 → 104.653
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-2-34b: Epoch Capabilities Index (ECI) ↑ 104.803 → 104.868
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-2-13b-chat: Epoch Capabilities Index (ECI) ↑ 105.772 → 105.834
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-2-13b: Epoch Capabilities Index (ECI) ↑ 105.772 → 105.834
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen-7B: Epoch Capabilities Index (ECI) ↑ 106.422 → 106.482
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
LLaMA-33B: Epoch Capabilities Index (ECI) ↑ 107.029 → 107.087
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Nemotron-4 15B: Epoch Capabilities Index (ECI) ↑ 107.344 → 107.401
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
phi-2: Epoch Capabilities Index (ECI) ↑ 107.535 → 107.59
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
falcon-11b: Epoch Capabilities Index (ECI) ↑ 109.185 → 109.236
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
LLaMA-65B: Epoch Capabilities Index (ECI) ↑ 109.817 → 109.865
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemma-7b: Epoch Capabilities Index (ECI) ↑ 111.66 → 111.702
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
falcon-180B: Epoch Capabilities Index (ECI) ↑ 111.822 → 111.864
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Mistral-7B-Instruct-v0.1: Epoch Capabilities Index (ECI) ↑ 111.877 → 111.918
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Mistral-7B-v0.1: Epoch Capabilities Index (ECI) ↑ 111.877 → 111.918
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen-14B-Chat: Epoch Capabilities Index (ECI) ↑ 112.703 → 112.742
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen-14B: Epoch Capabilities Index (ECI) ↑ 112.703 → 112.742
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen2.5-Coder-7B-Instruct: Epoch Capabilities Index (ECI) ↑ 112.816 → 112.853
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen2.5-Coder-7B: Epoch Capabilities Index (ECI) ↑ 112.816 → 112.853
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-3.5-turbo-0613: Epoch Capabilities Index (ECI) ↑ 113.032 → 113.07
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-2-70b-chat: Epoch Capabilities Index (ECI) ↑ 113.639 → 113.673
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-2-70b-hf: Epoch Capabilities Index (ECI) ↑ 113.639 → 113.673
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-2-70b-chat-hf: Epoch Capabilities Index (ECI) ↑ 113.639 → 113.673
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-3.5-turbo-0125: Epoch Capabilities Index (ECI) ↑ 114.329 → 114.352
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-3.1-8B-Instruct: Epoch Capabilities Index (ECI) ↑ 115.364 → 115.385
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Meta-Llama-3-8B: Epoch Capabilities Index (ECI) ↑ 116.368 → 116.391
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Meta-Llama-3-8B-Instruct: Epoch Capabilities Index (ECI) ↑ 116.368 → 116.391
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
StableBeluga2: Epoch Capabilities Index (ECI) ↑ 116.837 → 116.862
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemini-1.0-pro-001: Epoch Capabilities Index (ECI) ↑ 116.955 → 116.974
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Phi-3-mini-4k-instruct: Epoch Capabilities Index (ECI) ↑ 117.076 → 117.098
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Yi-34B: Epoch Capabilities Index (ECI) ↑ 117.086 → 117.107
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Yi-34B-Chat: Epoch Capabilities Index (ECI) ↑ 117.086 → 117.107
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
claude-3-haiku-20240307: Epoch Capabilities Index (ECI) ↑ 117.577 → 117.593
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Mixtral-8x7B-v0.1: Epoch Capabilities Index (ECI) ↑ 118.078 → 118.095
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
open-mixtral-8x7b: Epoch Capabilities Index (ECI) ↑ 118.078 → 118.095
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Mixtral-8x7B-Instruct-v0.1: Epoch Capabilities Index (ECI) ↑ 118.078 → 118.095
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-3.5-turbo-1106: Epoch Capabilities Index (ECI) ↑ 118.233 → 118.248
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
claude-2.1: Epoch Capabilities Index (ECI) ↑ 118.287 → 118.295
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Mistral-Nemo-Instruct-2407: Epoch Capabilities Index (ECI) ↑ 118.339 → 118.354
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Mistral-Nemo-Base-2407: Epoch Capabilities Index (ECI) ↑ 118.339 → 118.354
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
open-mistral-nemo-2407: Epoch Capabilities Index (ECI) ↑ 118.339 → 118.354
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen2.5-Coder-32B: Epoch Capabilities Index (ECI) ↑ 119.217 → 119.232
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemma-2-9b: Epoch Capabilities Index (ECI) ↑ 119.542 → 119.552
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemma-2-9b-it: Epoch Capabilities Index (ECI) ↑ 119.542 → 119.552
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
claude-2.0: Epoch Capabilities Index (ECI) ↑ 119.546 → 119.56
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
claude-3-sonnet-20240229: Epoch Capabilities Index (ECI) ↑ 119.847 → 119.856
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
mistral-large-2402: Epoch Capabilities Index (ECI) ↑ 120.985 → 120.993
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Phi-3-medium-128k-instruct: Epoch Capabilities Index (ECI) ↑ 121.036 → 121.045
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
claude-instant-1.2: Epoch Capabilities Index (ECI) ↑ 121.068 → 121.078
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
claude-instant-1.1: Epoch Capabilities Index (ECI) ↑ 121.068 → 121.078
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Mixtral-8x22B-Instruct-v0.1: Epoch Capabilities Index (ECI) ↑ 121.237 → 121.245
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Mixtral-8x22B-v0.1: Epoch Capabilities Index (ECI) ↑ 121.237 → 121.245
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
open-mixtral-8x22b: Epoch Capabilities Index (ECI) ↑ 121.237 → 121.245
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Phi-3-small-8k-instruct: Epoch Capabilities Index (ECI) ↑ 121.602 → 121.609
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4-0613: Epoch Capabilities Index (ECI) ↑ 121.863 → 121.871
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemma-2-27b-it: Epoch Capabilities Index (ECI) ↑ 122.445 → 122.451
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemini-1.5-flash-0514: Epoch Capabilities Index (ECI) ↑ 122.518 → 122.525
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemini-1.5-flash-001: Epoch Capabilities Index (ECI) ↑ 122.518 → 122.525
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Meta-Llama-3-70B: Epoch Capabilities Index (ECI) ↑ 122.587 → 122.595
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Meta-Llama-3-70B-Instruct: Epoch Capabilities Index (ECI) ↑ 122.587 → 122.595
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
amazon.nova-pro-v1:0: Epoch Capabilities Index (ECI) ↑ 123.856 → 123.857
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
DeepSeek-V2: Epoch Capabilities Index (ECI) ↓ 124.564 → 124.562
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
qwen2-72b-instruct: Epoch Capabilities Index (ECI) ↓ 125.464 → 125.462
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-3.2-90B-Vision-Instruct: Epoch Capabilities Index (ECI) ↑ 125.687 → 125.691
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4-32k-0314: Epoch Capabilities Index (ECI) ↓ 125.918 → 125.913
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4-0314: Epoch Capabilities Index (ECI) ↓ 125.918 → 125.913
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
claude-3-opus-20240229: Epoch Capabilities Index (ECI) ↑ 126.907 → 126.909
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4o-mini-2024-07-18: Epoch Capabilities Index (ECI) ↑ 126.932 → 126.933
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-3.3-70B-Instruct: Epoch Capabilities Index (ECI) ↓ 127.495 → 127.492
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
claude-3-5-haiku-20241022: Epoch Capabilities Index (ECI) ↓ 127.498 → 127.495
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
mistral-large-2407: Epoch Capabilities Index (ECI) ↓ 127.542 → 127.541
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4-turbo-2024-04-09: Epoch Capabilities Index (ECI) ↓ 127.59 → 127.589
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
mistral-small-2503: Epoch Capabilities Index (ECI) ↑ 0 → 127.738
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
mistral-large-2411: Epoch Capabilities Index (ECI) ↓ 128.8 → 128.798
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4o-2024-05-13: Epoch Capabilities Index (ECI) ↓ 128.821 → 128.818
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-3.1-405B: Epoch Capabilities Index (ECI) ↓ 128.994 → 128.988
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-3.1-405B-Instruct: Epoch Capabilities Index (ECI) ↓ 128.994 → 128.988
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen2.5-72B: Epoch Capabilities Index (ECI) ↓ 129.11 → 129.099
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Qwen2.5-VL-72B-Instruct: Epoch Capabilities Index (ECI) ↓ 129.11 → 129.099
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
qwen2.5-72b-instruct: Epoch Capabilities Index (ECI) ↓ 129.11 → 129.099
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4o-2024-08-06: Epoch Capabilities Index (ECI) ↓ 129.171 → 129.167
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4o-2024-11-20: Epoch Capabilities Index (ECI) ↓ 129.277 → 129.275
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemini-1.5-flash-002: Epoch Capabilities Index (ECI) ↓ 130.371 → 130.368
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
chutes/Llama-4-Scout-17B-16E Instruct: Epoch Capabilities Index (ECI) ↓ 130.55 → 130.544
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
Llama-4-Scout-17B-16E-Instruct: Epoch Capabilities Index (ECI) ↓ 130.55 → 130.544
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
grok-2-1212: Epoch Capabilities Index (ECI) ↓ 130.78 → 130.775
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gpt-4.1-nano-2025-04-14: Epoch Capabilities Index (ECI) ↓ 130.868 → 130.861
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
phi-4: Epoch Capabilities Index (ECI) ↓ 131.043 → 131.037
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
chutes/Gemma-3-27b-It: Epoch Capabilities Index (ECI) ↓ 131.074 → 131.067
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot → -
gemma-3-27b-it: Epoch Capabilities Index (ECI) ↓ 131.074 → 131.067
Tracking since Jun 25, 2026 · 5 snapshots on file
Same model id, score moved on Epoch Capabilities Index — a silent re-evaluation or model swap.
Full history & current snapshot →
No changes match these filters.