Highest Epoch-acknowledged FrontierMath score at EOY2026?
7
3kṀ5201
2026
52.1 %
expected
4%
10 - 19.99%
17%
20 - 29.99%
16%
30 - 39.99%
15%
40 - 49.99%
9%
50 - 59.99%
12%
60 - 69.99%
12%
70 - 79.99%
9%
80 - 89.99%
6%
90 - 100%

While OpenAI has claimed that o3-mini achieved 32% on FrontierMath, I don't really believe them, plus they used an ungodly amount of compute.

When judging how much progress has been made on FrontierMath, I prefer to defer to Epoch. The highest Epoch-validated FrontierMath score is o3-mini-high, with 11%.

At end-of-year 2026, what will be the highest performance on FrontierMath, according to Epoch? To resolve this, I will use their AI Benchmarking Hub, or -- if that page becomes out of date -- whatever I consider the authoritative Epoch source on FrontierMath to be.

It seems plausible that Epoch will give different numbers depending on amount of compute, scaffolding, etc. If so, I will resolve this to the highest number claimed by Epoch -- though note that a number only counts if it was validated by Epoch. If Epoch lists self-reported numbers from a lab that it has not validated, then those numbers do not count for the resolution of this market.

Get
Ṁ1,000
to start trading!
Sort by:
bought Ṁ50 YES

o3 and o4-mini were added to the official results yesterday. o3 scored 10%, and o4-mini scored 17%.

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules