Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Coding Evaluation | Manifold

Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Coding Evaluation

Basic

3

Ṁ60

Oct 1

28%

chance

1D

1W

1M

ALL

Context:

Gemini-1.5-Pro-Exp-0801 is currently the leading model on the LMYS Arena leaderboard (https://arena.lmsys.org/).
This market is about its potential evaluation by Scale AI (https://scale.com/leaderboard).

Resolution Criteria:

The market resolves as "Yes" if the model is evaluated by Scale AI and It receives a score strictly larger than 1165 in the Coding category.
The market resolves as "No" if the model is evaluated by Scale AI and it receives a score of 1165 or less in the Coding category
The market resolves as "N/A" if either
1. Scale AI doesn't evaluate the model and add it to the leaderboard before October 1, 2024 or
2. The evaluation methodology changes before the model is evaluated.

This question is managed and resolved by Manifold.

#Chatbot Arena Leaderboard

Get

1,000

and

3.00

Sort by:

I can't find "N/A" option

Related questions

Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Math Evaluation

Will Gemini achieve a higher score on the SAT compared to GPT-4?

Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?

Will Gemini 1.5 Pro seem to be as good as Gemini 1.0 Ultra for common use cases? [Poll]

Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?

Will Gemini-1.5-Pro-Exp-0801 Score Above 90.35 (current #1) in Scale AI's Instruction Following Evaluation

Will Gemini-1.5-Pro-Exp-0801 Score Lower Than 8 (current best) in Scale AI's Adversarial Robustness

Will Gemini outperform GPT-4 at mathematical theorem-proving?

Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?

Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?

Related questions

Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Math Evaluation

Will Gemini-1.5-Pro-Exp-0801 Score Above 90.35 (current #1) in Scale AI's Instruction Following Evaluation

Will Gemini achieve a higher score on the SAT compared to GPT-4?

Will Gemini-1.5-Pro-Exp-0801 Score Lower Than 8 (current best) in Scale AI's Adversarial Robustness

Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?

Will Gemini outperform GPT-4 at mathematical theorem-proving?

Will Gemini 1.5 Pro seem to be as good as Gemini 1.0 Ultra for common use cases? [Poll]

Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?

Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?

Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules