Will Google Gemini perform better (text) than GPT-4? | Manifold

Will Google Gemini perform better (text) than GPT-4?

Plus

46

Ṁ6208

Dec 31

55%

chance

1D

1W

1M

ALL

"perform better" refers to the text performance only, to keep it simple. To be comparable the performance should be equal or extremely close across a wide range of benchmarks (e.g. MMLU, HumanEval, WinoGrande) and chat/agent tests (e.g. MT-Bench). It should also have at least 8k context length (chosen since GPT-4 has 8k and 32k context length versions).

Of course, to qualify as YES, the group that develops a competitor must publicly announce that they trained an LLM with the benchmark results, or make an API available to external evaluators. If Gemini is released exclusively through a chat interface and the only benchmarks are internal to Google, then this market will resolve N/A because of a lack of sufficient information.

Market will resolve as soon as we can get accurate evaluations for Gemini after it releases. The only situation in which this market should make it to its end date is if Gemini is not released to external evaluators by EOY 2024.

GPT-4's reference results will be the GPT-4 API at the time of Gemini evaluation (i.e. same month). If GPT-4.5 releases, this will not be considered.

This question is managed and resolved by Manifold.

#GPT-5 Speculation

#GPT-4 speculation

Get

1,000

and

3.00

Sort by:

predictedYES

How are you going to handle the multi size release?

If we go by their word,ultra beats gpt-4,but it isn't publicly available....

predictedNO

@array_wake I suggest wait for ultra

Related questions

Will Gemini achieve a higher score on the SAT compared to GPT-4?

Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?

Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?

Will the Google Gemini App have as many installs as the ChatGPT app on Android by end of 2024?

Will YouTube comments be part of GPT5 *or* Google Gemini's latest version by EOY 2024?

Will Gemini outperform GPT-4 at mathematical theorem-proving?

Will an open source model beat GPT-4 in 2024?

Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?

Will Google Gemini do as well as GPT-4 on Sparks of AGI tasks?

Did Google intentionally announce Gemini Ultra in a state barely outperforming GPT-4 to slow the capabilities race?

Related questions

Will Gemini achieve a higher score on the SAT compared to GPT-4?

Will Gemini outperform GPT-4 at mathematical theorem-proving?

Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?

Will an open source model beat GPT-4 in 2024?

Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?

Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?

Will the Google Gemini App have as many installs as the ChatGPT app on Android by end of 2024?

Will Google Gemini do as well as GPT-4 on Sparks of AGI tasks?

Will YouTube comments be part of GPT5 *or* Google Gemini's latest version by EOY 2024?

Did Google intentionally announce Gemini Ultra in a state barely outperforming GPT-4 to slow the capabilities race?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules