Will I judge GPT-5 to be smarter than o1 (not preview) after both are released?
Plus
14
Ṁ11552026
77%
chance
1D
1W
1M
ALL
Resolves subjectively, based on my analysis of benchmarks both official, third party, and my own.
Some examples of benchmarks I consider are MMLU, ZebraLogic, SWE-bench, simplebench, ARC, and livebench.
Some of my own evals are game-playing (tic-tac-toe, and connect 4), and creative writing (giving a model 3 random nouns and asking it to write a story involving them)
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
What do you do if no model named GPT 5 will be released, but instead they continue with the oN scheme for all their models?
@yetforever Resolves n/a. Though a departed researcher already described working on GPT-5 so I would be surprised if that happened
Related questions
Related questions
Will GPT-5 be released before 2025?
5% chance
What will be true about GPT-5?
Will there be a GPT-4.5 model before GPT-5 is released?
16% chance
Which Benchmarks will GPT-5 be benchmarked against, when it is announced?
Will it be revealed that GPT-5 was used for how GPT-5 will be released?
66% chance
What will be true about GPT-5? (See description)
When GPT-5 comes out, will more Manifold users say it exceeded or fell below expectations?
Will GPT-5 be more competent than me in my area of expertise?
44% chance
Will GPT-5 be released before Apr 2025?
32% chance
Will GPT-5 perform better than o1 (not preview) at AIME 2024, Codeforces elo, GPQA, or the 2024 ioi?
66% chance