Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
Standard
13
Ṁ803resolved Sep 16
Resolved
YES1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).
If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.
There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.
Get
1,000
and1.00
Related questions
Related questions
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
66% chance
Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2024?
18% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
81% chance
By the end of Q1 2025 will an open source model beat OpenAI’s o1 model?
30% chance
Will there be a model that has a 75% win rate against the latest iteration of GPT-4 as of January 1st, 2025?
63% chance
By the end of Q2 2025 will an open source model beat OpenAI’s o1 model?
53% chance
Will OpenAI be in the lead in the AGI race end of 2026?
44% chance
Will OpenAI launch a significantly better model for ChatGPT paying users in 2024? (>= 100 points diff on ChatBot Arena)
19% chance
Will OpenAI release GPT-5 before the end of 2024?
23% chance
Will the gap between open-weights and frontier models on GPQA be at most 7%?
52% chance