Did OpenAI use MUP for zero shot hyper-parameter transfer in GPT-4? | Manifold

Did OpenAI use MUP for zero shot hyper-parameter transfer in GPT-4?

5

110Ṁ169

Dec 31

81%

chance

1D

1W

1M

ALL

Maximal Update Parameterization is technique published last year by Yang et al. at Microsoft. https://arxiv.org/abs/2203.03466

— LLM & AI Capabilities—

Get

1,000

to start trading!

Sort by:

predictedYES

@firstuserhere interesting that it is in the bibliography, although the reference in the first image is from a different section of the report with its own bibliography (that [16] actually refers to "DALL·E 2 Preview - Risks and Limitations.").

So the muP paper is in the bibliography, but not referenced anywhere.

@Stefan yep, and even then it's not actually used in gpt-4, the report only mentions the red team to have used the paper?

People are also trading

Will OpenAI release o3 >1 week before GPT-5?

Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?

+8% 1d75% chance

Will OpenAI's autonomous agent be based on GPT-4?

Did OpenAI transcribe Youtube videos to train a GPT model as claimed by NYT?

Will OpenAI change their naming scheme (GPT-X) with the successor to GPT-4? (Ṁ200 subsidy!)

Will OpenAI release true multimodal image generation for GPT-4.5 before 2026?

Will the next LLM released by OpenAI be worse than GPT-4 at MMLU?

Will OpenAI open source the weights to one of the GPT family models in 2024?

Will there be evidence in 2025 that in April 2023, OpenAI had a GPT-4.5 or higher model?

Will OpenAI abandon discrete GPT releases in favor of continuous updates?

Related questions

Will OpenAI release o3 >1 week before GPT-5?

Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?

Will OpenAI's autonomous agent be based on GPT-4?

Did OpenAI transcribe Youtube videos to train a GPT model as claimed by NYT?

Will OpenAI change their naming scheme (GPT-X) with the successor to GPT-4? (Ṁ200 subsidy!)

Will OpenAI release true multimodal image generation for GPT-4.5 before 2026?

Will the next LLM released by OpenAI be worse than GPT-4 at MMLU?

Will OpenAI open source the weights to one of the GPT family models in 2024?

Will there be evidence in 2025 that in April 2023, OpenAI had a GPT-4.5 or higher model?

Will OpenAI abandon discrete GPT releases in favor of continuous updates?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules