Sudden jump in AI long-horizon capabilities (1-4 hr version) | Manifold

Sudden jump in AI long-horizon capabilities (1-4 hr version)

6

1kṀ483

2028

65%

chance

1D

1W

1M

ALL

After an AI achieve >50% performance on 15-60 minute tasks, will it take less than one year for AI to achieve >50% performance on 1-4 hour tasks?

We will default to use reporting from OpenAI, METR or other large AI organizations. If compelling third-party scaffolding demonstrations reports on this first, I will accept that if I am >90% confident in their results being accurate. The results need not use SWE-bench or METR's pre-existing dataset, if e.g. a model resolves this question on Metaculus that would be obviously sufficient. Agent/assistant tasks and code tasks both count here, if either shows sub 1-year jump then this resolves Yes. I will not predict on this question.

Background: As of mid-2024, models are often far more efficient than humans at <15 minute tasks. However, for >15 minute tasks models remain highly inconsistent.

https://metr.org/blog/2024-08-06-update-on-evaluations/

https://openai.com/index/introducing-swe-bench-verified/

Technical AI Timelines

Get

1,000

to start trading!

Sort by:

I'm open to suggestions on this question's resolution criteria for a month, and then I'll try to keep revision minimal afterwards.

Related questions

In 2 years, what will the Metaculus prediction for AGI timelines be?

Will software-side AI scaling appear to be suddenly discontinuous before 2025?

Are we about to hit another AI winter in 2025?

Which AI future will we get?

100GW AI training run before 2031?

Will we have better-than-human-aggregate forecasting AIs by the end of 2024?

Will the AI Safety Clock reach 19 minutes to midnight by July 2025?

Will AGI cause US nominal GDP to at least double between 2025 to 2035?

Will AI lead to an S-risk by 2100?

What will be the top-3 AI tools in 2040?

Related questions

In 2 years, what will the Metaculus prediction for AGI timelines be?

Will we have better-than-human-aggregate forecasting AIs by the end of 2024?

Will software-side AI scaling appear to be suddenly discontinuous before 2025?

Will the AI Safety Clock reach 19 minutes to midnight by July 2025?

Are we about to hit another AI winter in 2025?

Will AGI cause US nominal GDP to at least double between 2025 to 2035?

Which AI future will we get?

Will AI lead to an S-risk by 2100?

100GW AI training run before 2031?

What will be the top-3 AI tools in 2040?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules