Will an AI score over 10% on FrontierMath Benchmark in 2025 | Manifold

Will an AI score over 10% on FrontierMath Benchmark in 2025

Basic

19

Ṁ2018

2025

79%

chance

1D

1W

1M

ALL

"Today we're launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%.
Existing math benchmarks like GSM8K and MATH are approaching saturation, with AI models scoring over 90%—partly due to data contamination. FrontierMath significantly raises the bar. Our problems often require hours or even days of effort from expert mathematicians.
We evaluated six leading models, including Claude 3.5 Sonnet, GPT-4o, and Gemini 1.5 Pro. Even with extended thinking time (10,000 tokens), Python access, and the ability to run experiments, success rates remained below 2%—compared to over 90% on traditional benchmarks."

This question is managed and resolved by Manifold.

#️ Technology

#Technical AI Timelines

#IMO Grand Challenge

Get

1,000

and

3.00

Sort by:

Related market:

Related questions

Will an AI achieve >30% performance on the FrontierMath benchmark before 2026?

-39% 1d28% chance

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

-7% 1d32% chance

Will an AI be capable of achieving a perfect score on the Putnam exam before 2030?

-5% 1d68% chance

Which lab's AI will be the first to score over 10% on FrontierMath benchmark?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Will an AI score over 30% on FrontierMath Benchmark in 2025

-8% 1d28% chance

By when will AI score >= 80% on FrontierMath

Will an AI score over 80% on FrontierMath Benchmark in 2025

Will an AI agent system be able to score at least 40% on level 3 tasks in the GAIA benchmark before 2025.

Related questions

Will an AI achieve >30% performance on the FrontierMath benchmark before 2026?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

Will an AI score over 30% on FrontierMath Benchmark in 2025

Will an AI be capable of achieving a perfect score on the Putnam exam before 2030?

By when will AI score >= 80% on FrontierMath

Which lab's AI will be the first to score over 10% on FrontierMath benchmark?

Will an AI score over 80% on FrontierMath Benchmark in 2025

What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?

Will an AI agent system be able to score at least 40% on level 3 tasks in the GAIA benchmark before 2025.

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules