Will OpenAI o1 (or any direct iteration like o3) get gold on any International Math Olympiad by the end of 2025?

132

11kṀ82k

Dec 31

29%

chance

ALL

Resolves similar to this market: https://manifold.markets/Austin/will-an-ai-get-gold-on-any-internat, but the AI model has to be OpenAI o1 or a direct iteration of o1.

The market will resolve to YES if:
- The linked market resolves YES because of OpenAI o1 (or any direct iteration)
- OpenAI o1 (or any direct iteration) accomplishes a comparable feat as the model that caused the linked market to resolve YES

Note: By "direct iteration," this market refers to any model explicitly branded as a continuation or update of OpenAI o1, such as a model named "o2" or "o3", rather than a distinct, separately branded model.

Technical AI Timelines

OpenAI

Math

IMO Grand Challenge

Get

1,000

to start trading!

People are also trading

AI IMO 2025: How many AI labs announce a Gold performance at the IMO in 2025?

Will an AI get bronze or silver on any International Math Olympiad by end of 2025?

77% chance

Will an AI get bronze on any International Math Olympiad by 2025?

88% chance

Will an AI get silver on any International Math Olympiad by 2025?

86% chance

What will be the first year that an AI gets gold on an International Mathematical Olympiad?

Will an AI publicly accessible before IMO 2025 get the gold medal on it?

22% chance

Will an AI get at least silver on International Olympiad in Informatics (IOI) by end of 2025?

99% chance

Will AI get at least bronze on the International Olympiad in Informatics (IOI) before 2026?

96% chance

Will the AIMO (Artificial Intelligence Mathematical Olympiad) Grand Prize be won before 2026?

Sort by:

bought Ṁ3,000 NO

https://arxiv.org/pdf/2503.21934v1

Sama has stated that "a top goal for us is to unify o-series models and GPT-series models", and "In both ChatGPT and our API, we will release GPT-5 as a system that integrates a lot of our technology, including o3. We will no longer ship o3 as a standalone model.".

Given this, will GPT-5 count as a "direct iteration" of o1 in this market?

@MaxMusing Based on Sama's statement, GPT-5 appears to be a 'distinct, separately branded model' rather than a direct iteration of o1. However, I'll wait for OpenAI's official announcement of GPT-5 before making a final decision, particularly regarding what they mean by 'integrate.'

We need to start including cost in these markets.

o1-preview is incredible, but it's useless because it's ridiculously expensive and limited. If we keep basing progress off of models that are only usable by a select few, then it's not really progress.

If the model is price competitive with, for example, software engineers, it is really progress. and they seem to be? does it cost more than 50$ per hour of "thinking"? The cost can also go down a lot from here.

@Bayesian Even if it's cost-competitive, there are other issues.

It has to be open source. You can't build a business around a model owned by a company when the law allows that company to terminate service to you for any time for any reason. I stopped all use of E-Mail marketing firms for that reason, because they terminated service to me.
The model has to actually be available, and o1-preview is not. The world is clearly limited by manufacturing capacity and will be for some time. I would pay $1000/month to use the model but nobody will offer me that because the GPUs don't exist.

These models are clearly "AGI," without a doubt. But I've always thought that @EliezerYudkowsky 's ideas that the world is suddenly going to change overnight were ridiculous. AGI hasn't, and won't, change the world for some time because intelligence is not our limiting factor.

actually you can. Lots of software is not opensource and is used by businesses all of the time
How is o1-preview not available?

@SteveSokolowski your ideas are unreasonable and do not represent real world

@Bayesian It is true that businesses build software around a single business. It's also a really poor idea to do so. If you build an entire business solely around OpenAI's products, you are out of business if OpenAI bans you from the service, even if there isn't a good reason for them to do so.

@SteveSokolowski o1-preview is priced similarly to the original davinci model. Should we consider that ridiculously expensive?

The market makes no comment on progress, availability, or open-source. Confused as to how your points should affect the price.

@SteveSokolowski Given that cost goes down so rapidly (around a factor of 4 per year) I think that is not particularly meaningful to think about it much.

>- OpenAI o1 (or any direct iteration) accomplishes a comparable feat as the model that caused the linked market to resolve YES

What do you mean by "comparable feat"? Does it have to solve in the same time limit human competitors get?

@mongo Will resolve similarly to the linked market, so probably with time limit

So if a model called o2 gets it, this resolves no?

@ChrisPrichard It should still resolve Yes.
The original description included "future iteration". I changed it slightly to make it more explicit that the new iteration does not have to be called o1 (but has to be directly based on o1).

bought Ṁ500 YES

@Simon74fe Then it seems much more likely to me! Basically, will something that uses a lot of test time inference succeed. It doesn't need to be the first to do it to resolve yes, right? Like maybe some specialized model does it first and then o3 with gpt-5 backing it can also do it and this resolves yes.

opened a Ṁ1,000 NO at 32% order

Given the title says "o1" this is very confusing. And "directly based on" - what does that mean, if OpenAI builds GPT 5 which iterates on o1 does that count too?

Updated the description again and included it in the title. GPT-6 would most likely not count even if it uses some insights from o1, unless it is explicitly marketed as a direct iteration of o1.