Will an LLM break 1400 ELO on LMSys before February?
Premium
9
Ṁ2996Feb 2
54%
chance
1D
1W
1M
ALL
Google currently leads with Gemini -- which has two models at around 1370
But OpenAI just announced O3 -- which is getting great marks on things like hard science questions.
https://deepnewz.com/ai-modeling/openai-unveils-o3-o3-mini-models-exceeding-human-performance-on-arc-agi-4f05e4f7
The resolution is simple. Will and LMSys update contain a model with 1400 ELO? Cutoff is last day in January (East Coast time).
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
Related questions
Related questions
Will any LLM outrank GPT-4 by 150 Elo in LMSYS chatbot arena before 2025?
6% chance
Will AIs stay below 1453 elo in 2024 on chat.lmsys.org/?leaderboard as predicted by Gary Marcus?
98% chance
What organization will have the highest ELO score in the LMSYS Org Chatbot Arena Leaderboard at the end of Dec, 2024?
Which organization will have the top LLM on LMSys on March 1st?
Will an LLM be able to solve a Rubik's Cube by 2025?
25% chance
Llama 3 405B ELO on Lmsys Arena Leaderboard 2 weeks after first appearance?
EOY 2025: Will open LLMs match closed-source LLMs on coding to within 50 ELO points?
36% chance
Will LLMs mostly overcome the Reversal Curse by the end of 2025?
67% chance
Will another LLM prompting challenge with prizes worth $10k+ be completed by the end of 2024?
17% chance
Can an LLM beat me in chess?
10% chance