🐕 Will A.I. Achieve Significantly Higher Performance Over "General Conceptual Skills" by end of 2024? | Manifold

🐕 Will A.I. Achieve Significantly Higher Performance Over "General Conceptual Skills" by end of 2024?

9

150Ṁ215

Jan 1

25%

chance

1D

1W

1M

ALL

Continuation of:

https://manifold.markets/PatrickDelaney/will-ai-achieve-significantly-highe

I reserve the right to change the metrics if they have grown stale in the above. Aim to get this finalized by end of January 2024.

Technical AI Timelines

Third Party Validated, Predictive Markets: AI

Get

1,000

to start trading!

Sort by:

predictedYES

The currently available flagship models (PaLM 2, GPT-4, and Gemini Pro) have not yet been evaluated. As far as I can tell, the largest model is the original PaLM, not PaLM 2. Additionally, it is GPT-3, not GPT-4V which is being evaluated. You can verify this in their published paper.

This is because GPT-4 stated in their technical report that they are not evaluating using BIG-Bench because "portions of BIG-Bench were inadvertently mixed into the training set..." (pg. 6).

Given that the question is trying to gauge whether advances in AI this year are significantly higher with respect to "general conceptual skills", I would argue we need a new metric which includes the current state of the art models.

I don't think you can fairly resolve this market by carrying over the old metric of achieving a 60 on the BIG-Bench Lite to another test. I propose resolving this N/A and remaking this with the Massive Multitask Language Understanding.

People are also trading

🐕 Will A.I. Achieve Significantly More, "Linguistic Temporal Understanding" by end of 2024?

🐕 Will AI Achieve Significantly More, "Embodiment" by end of 2024?

🐕 Will A.I. Be Able to Make Significantly Better, "Common Sense Judgements About What Happens Next," by End of 2024?

🐕 Will A.I. Get Significantly Better at Evaluating Scientific Claims by the end of 2024? (As Measured By Leaderboard)

🐕 Will AI Be Able to Gain a Much Broader Academic and Professional Understanding by the End of 2024?

🐕 Will A.I. Be Significantly Better at, "Egocentric Navigation," by the End of 2024?

🐕 Will A.I. Be Able to Meet Just Below Human Performance In Being Able to "Track Changes in State," By the End of 2024?

🐕 Will AI Be Able to Understand the, "Meaning" of Questions Significantly Better By the End of 2024?

🐕 Will Any AI Effectively Achieve Higher than Human Level Reasoning Through Common Sense Questions, By 2024 End?

🐕 Will A.I. Become Significantly Better at Drug Discovery in 2024?

Related questions

🐕 Will A.I. Achieve Significantly More, "Linguistic Temporal Understanding" by end of 2024?

🐕 Will AI Achieve Significantly More, "Embodiment" by end of 2024?

🐕 Will A.I. Be Able to Make Significantly Better, "Common Sense Judgements About What Happens Next," by End of 2024?

🐕 Will A.I. Get Significantly Better at Evaluating Scientific Claims by the end of 2024? (As Measured By Leaderboard)

🐕 Will AI Be Able to Gain a Much Broader Academic and Professional Understanding by the End of 2024?

🐕 Will A.I. Be Significantly Better at, "Egocentric Navigation," by the End of 2024?

🐕 Will A.I. Be Able to Meet Just Below Human Performance In Being Able to "Track Changes in State," By the End of 2024?

🐕 Will AI Be Able to Understand the, "Meaning" of Questions Significantly Better By the End of 2024?

🐕 Will Any AI Effectively Achieve Higher than Human Level Reasoning Through Common Sense Questions, By 2024 End?

🐕 Will A.I. Become Significantly Better at Drug Discovery in 2024?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules