In 2024, will METR or Google announce the results of a METR eval on a Google LLM? | Manifold

In 2024, will METR or Google announce the results of a METR eval on a Google LLM?

Basic

7

Ṁ189

Jan 1

72%

chance

1D

1W

1M

ALL

METR = formerly ARC Evals (https://metr.org/)

if METR/Google reorgs and has a clear successor org, that org also applies for the purpose of this market

central YES cases:

if Google releases a model card with something like Gemini Supermega 2024 edition with METR exfiltration eval results, like OpenAI did for GPT4 technical report

does not have to be the specific exfiltration eval.

does not have to be included in initial model release paper. does not have to be specifically in a paper.

does not have to be any specific eval granularity. "METR ran the eval and it was all OK" would be ... annoyingly vague from whoever would write it, but it would count.

has to be confirmed-ish by Google and/or METR. can't be just a Twitter rumor.

This question is managed and resolved by Manifold.

#Technical AI Timelines

Get

1,000

and

3.00

Sort by:

I am not aware of this having happened. I'll sometime soon do a little bit of search and resolve from what I find. But if anyone has evidence and can save me some work, I'd appreciate.

Related questions

Will Google cancel an LLM-based product by end of 2025?

Will the new LLM released by Meta be open-source?

Will there be an LLM which scores above what a human can do in 2 hours on METR's eval suite before 2026?

Will YouTube Comments make it into a major LLM by EOY 2027?

Will a major technology company publicly admit to using a LLM for important decision making before 2025?

Will Google announce that it's going to power Google Translate with an LLM-only based system (before 2024 end)?

[Metaculus] Will Google implement a feature to explain targeted Google Ads before 2026?

In what year will Google or Meta release a commercial facial recognition search product?

Will Meta AI's MEGABYTE architecture be used in the next-gen LLMs?

Will there be a clear way to integrate LLMs with ads by the end of 2024?

Related questions

Will Google cancel an LLM-based product by end of 2025?

Will Google announce that it's going to power Google Translate with an LLM-only based system (before 2024 end)?

Will the new LLM released by Meta be open-source?

[Metaculus] Will Google implement a feature to explain targeted Google Ads before 2026?

Will there be an LLM which scores above what a human can do in 2 hours on METR's eval suite before 2026?

In what year will Google or Meta release a commercial facial recognition search product?

Will YouTube Comments make it into a major LLM by EOY 2027?

Will Meta AI's MEGABYTE architecture be used in the next-gen LLMs?

Will a major technology company publicly admit to using a LLM for important decision making before 2025?

Will there be a clear way to integrate LLMs with ads by the end of 2024?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules