Who will release the next generation-defining LLM? | Manifold

Who will release the next generation-defining LLM?

Technology AI GPT-5

Basic

34

Ṁ1.5k

2025

1D

1W

1M

ALL

53%

OpenAI

13%

Meta

40%

Anthropic

10%

Google/Alphabet

11%

Mistral

15%

Other

Who will release the next Large Language Model that has an LMSYS Arena Elo of at least 1334; 75 points better than the current leader?

As of the 22nd of April, 2024 there are 4 models with an Arena Elo between 1249 and 1259 according to the LMSYS leaderboard: 3 versions of GPT-4 and 1 version of Claude 3 Opus. The highest rated GPT-3.5-Turbo version has an Elo of 1119, 46 points behind the lowest GPT-4 version (0613 for both), while the 0314 versions of these models have an Elo gap of 82 points. Thus, a 75 point gap would represent a breakthrough and a new generation of LLMs.

Elo will be evaluated 1 week after the model enters the leaderboard. If 1334 is within the top contender's confidence interval, l'll wait 1 additional week and resolve based on the Elo then. If multiple models meet the criteria, the earliest release date is the winner.

https://chat.lmsys.org/?leaderboard

Get Ṁ600 play money

Related questions

Who will have the best LLM at the end of 2024 (as decided by ChatBot Arena)?

When will the next generation-defining LLM be released?

Will Apple release its own LLM on par with state of the art LLMs before 2026?

At the end of 2024, what type of LLM prompt will be the most popularized?

Who will be ahead in the AI/LLM war by the end of 2024?

Will Google have the best LLM by EOY 2024?

Which company will have the best LLM by the end of 2024?

Will the most interesting AI in 2027 be a LLM?

Will the leading LLM at the beginning of 2026 still be subject to the reversal curse?

Will the most advanced LLM stop being from a US-based company any time before 2030?

Sort by:

bought Ṁ15 OpenAI NO

Claude 3.5 could be a contender here

👀👀

https://x.com/LiamFedus/status/1790064963966370209

@Mactuary Well, GPT-4o is on the leaderboard and well below the Elo needed to resolve, so I think we're still waiting for GPT-5 or Llama 3 400b or...

Related market on when the model will be released

so only one of the option will resolve yes?

@Sss19971997 That's right. If two models meeting the criteria are released in close succession, the one that was released first will be the winner (not the one that has the higher Elo).

Related in Technology

Good Tweet or Bad Tweet? Which controversial posts will Manifold think are a "Good Take" this week?

Will GPT-5 be released before 2025?

See more Technology questions

Related in AI

In early 2028, will an AI be able to generate a full high-quality movie to a prompt?

Will a large language model beat a super grandmaster playing chess by 2028?

See more AI questions

Related in GPT-5

Will GPT-5 be released before Aug 2024?

Will OpenAI open source the weights to one of the GPT family models in 2024?

See more GPT-5 questions

More related questions

Who will have the best LLM at the end of 2024 (as decided by ChatBot Arena)?

When will the next generation-defining LLM be released?

Will Apple release its own LLM on par with state of the art LLMs before 2026?

At the end of 2024, what type of LLM prompt will be the most popularized?

Who will be ahead in the AI/LLM war by the end of 2024?

Will Google have the best LLM by EOY 2024?

Which company will have the best LLM by the end of 2024?

Will the most interesting AI in 2027 be a LLM?

Will the leading LLM at the beginning of 2026 still be subject to the reversal curse?

Will the most advanced LLM stop being from a US-based company any time before 2030?

Technology questions

Good Tweet or Bad Tweet? Which controversial posts will Manifold think are a "Good Take" this week?

Will GPT-5 be released before 2025?

AI questions

In early 2028, will an AI be able to generate a full high-quality movie to a prompt?

Will a large language model beat a super grandmaster playing chess by 2028?

GPT-5 questions

Will GPT-5 be released before Aug 2024?

Will OpenAI open source the weights to one of the GPT family models in 2024?

Related questions

Who will have the best LLM at the end of 2024 (as decided by ChatBot Arena)?

Will Google have the best LLM by EOY 2024?

When will the next generation-defining LLM be released?

Which company will have the best LLM by the end of 2024?

Will Apple release its own LLM on par with state of the art LLMs before 2026?

Will the most interesting AI in 2027 be a LLM?

At the end of 2024, what type of LLM prompt will be the most popularized?

Will the leading LLM at the beginning of 2026 still be subject to the reversal curse?

Who will be ahead in the AI/LLM war by the end of 2024?

Will the most advanced LLM stop being from a US-based company any time before 2030?