Which AI will be the best at the end of 2025?
Basic
46
8.5k
2026
43%
GPT - OpenAI
37%
Claude - Anthropic
8%
Gemini - Google
5%
Other
3%
Yi - 01.AI
1.3%
Grok - xAI

Resolves to the exact same result as Kalshi's equivalent 2025 market if it exists, which uses lmsys chatbot leaderboard with other rules. Except if two models tie we will resolve them both 50 percent.

If not will use the ruleset of the 2024 market (linked below)

https://kalshi.com/markets/llm1/yearend-top-llm

Rules Summary (2024 version)

If OpenAI has the top-ranked LLM on Dec 31, 2024, then that market resolves to Yes. Outcome verified from LMSYS.

A tie would resolve to No.

Clarification 3/14/24 6:03 PM ET: The Contract's Underlying states that, "The Underlying for this Contract is the Arena Elo rankings of large language models on the LMSYS Chatbot Arena Leaderboard as checked at 10:00 AM ET daily after Issuance and before ." To be clear, this refers to the "rank" column of the leaderboard, not the Arena Elo Score proper. As of this update, the #1 spot is tied between two variations of GPT-4 and Claude 3 Opus. Moreover, the Payout Criterion states that, "If they [the target organization] have an LLM that is tied with another LLM, then the Payout Criterion is not fulfilled." To be clear, if the only tie is with the same organization, that organization's strike would resolve to Yes. So if on the target date the tie is between two variations of GPT-4, then the market would resolve Yes in favor of OpenAI. However, if it is tied between GPT-4 and Claude 3, then both OpenAI and Anthropic's strikes would resolve to No.

Get Ṁ600 play money
Sort by:

claude took the lead here but not kalshi

opened a Ṁ1,000 Claude - Anthropic YES at 20% order

So does the Tie condition resolve to “other”?

Will resolve the two tied winners as 50 percent each if that happens or use kalshis ruling

The excerpt you cited says:

the Payout Criterion states that, "If they [the target organization] have an LLM that is tied with another LLM, then the Payout Criterion is not fulfilled."”

So if they keep it the same, which are you going to do?

I'll resolve both to 50 in our case then

bought Ṁ50 GPT - OpenAI YES

upgraded