Will a Large Language Model beat me at chess this year? | Manifold

Will a Large Language Model beat me at chess this year?

Basic

13

Ṁ849

Jan 1

3%

chance

1D

1W

1M

ALL

I’m rated around 1900 FIDE. At the end of the 2024 I’ll play a game against an LLM at a rapid time control, selected from the top 3 of the leaderboard (https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard). Resolves YES if I lose, NO if I win, and 50% for a draw.

This question is managed and resolved by Manifold.

#Technical AI Timelines

Get

1,000

and

3.00

Sort by:

we can only hope.

What prompt will you be using? I imagine that changes their performance quite a bit

Good point! On each move, I’ll provide it the moves played so far in PGN notation, as well as the current position in FEN notation. This way both ways of representing position would be in context and in a standard format.

I think that makes the model significantly worse than it could otherwise be. I'd recommend using whatever prompt someone that claims "SOTA LLM chess" or something came up with

I’m planning to use lichess to play the game, and those are the representations it provides. In a future market this might change.

bought Ṁ43 NO

When I tested this with ChatGPT 3.0 a while back, it couldn't even remember the board position and kept making illegal moves. How will you resolve if it does this?

Let’s say three illegal moves will result in a loss. Distinctions like Rad1 vs. Rd1 won’t count towards this, but I’ll ask it for clarification.

Related questions

Will a large language model beat a super grandmaster playing chess by 2028?

Will a Language Model under 10B parameters play chess at Grandmaster level by 2050?

Will an LLM (a GPT-like text AI) defeat the World Champion at Chess before 2035?

How big will Mistral's known largest language model be? (2024)

Will any OpenAI model win a chess match against IM by the end of 2024?

When will a Large Language Model beat me at chess?

Will an Open Source LLM Surpass any GPT-4 model in Elo Rating on Chatbot Arena on december 31, 2024?

Will an AI by OpenAI beat a super grandmaster playing chess by 2028?

By 2025 will there be a competitive large language model with >50% of the total training data generated from a large language model?

Will a Large Language Model prove an important math theorem by end of 2024?

Related questions

Will a large language model beat a super grandmaster playing chess by 2028?

When will a Large Language Model beat me at chess?

Will a Language Model under 10B parameters play chess at Grandmaster level by 2050?

Will an Open Source LLM Surpass any GPT-4 model in Elo Rating on Chatbot Arena on december 31, 2024?

Will an LLM (a GPT-like text AI) defeat the World Champion at Chess before 2035?

Will an AI by OpenAI beat a super grandmaster playing chess by 2028?

How big will Mistral's known largest language model be? (2024)

By 2025 will there be a competitive large language model with >50% of the total training data generated from a large language model?

Will any OpenAI model win a chess match against IM by the end of 2024?

Will a Large Language Model prove an important math theorem by end of 2024?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules