Will an LLM be able to solve confusing but elementary geometric reasoning problems in 2024? (strict LLM version)
➕
Plus
29
Ṁ19k
Jan 1
14%
chance

This is a variant of the following market:

https://manifold.markets/dreev/will-an-llm-be-able-to-solve-confus

In this version, the problem has to be solved purely by the LLM itself.

Get
Ṁ1,000
and
S3.00
Sort by:

Sanity check: if GPT-o1 were to pull this off in time, that still counts as a strict LLM, right?

@dreev i think so

bought Ṁ250 NO

@dreev this will be increasingly challenging as more and more models are integrated into a single system, which is in part why I don't bet much on the "Will LLMs do [task] by [future year]?" markets, but yeah, I think it's reasonable to call GPT-o1 an LLM for the rest of 2024.

@Jacy Thank you, that makes a ton of sense. I shall avoid trying to single out LLMs in the future and hope that this one won't turn out too painful to adjudicate over the remaining 3 months in 2024. If anyone has any counterpoints about GPT-o1, chime in! (Not that it matters so far, with GPT-o1 failing our flagship geometric reasoning problem so far, but it does seem to be getting closer... 😬)

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules