Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?

13

1kṀ803

resolved Sep 16

Resolved

YES

1D

1W

1M

ALL

Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).

If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.

There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.

GPT-5 Capabilities

Technical AI Timelines

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ168
2		Ṁ68
3		Ṁ23
4		Ṁ22
5		Ṁ11

Related questions

Will OpenAI be in the lead in the AGI race end of 2026?

Will OpenAI's o4 get above 50% on humanity's last exam?

OpenAI's next major AI model will be more open than GPT-4 by June 30, 2025

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

+4% 1d23% chance

Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?

Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?

Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?

Will AI image generating models score >= 90% on Winoground by June 1, 2025?

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

Next model open sourced by OpenAI?

Related questions

Will OpenAI be in the lead in the AGI race end of 2026?

Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?

Will OpenAI's o4 get above 50% on humanity's last exam?

Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?

OpenAI's next major AI model will be more open than GPT-4 by June 30, 2025

Will AI image generating models score >= 90% on Winoground by June 1, 2025?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?

Next model open sourced by OpenAI?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules