Will a Mamba 7b model trained on 2 trillion tokens outperform Llama2-13B
Standard
21
Ṁ738Jul 1
66%
chance
1D
1W
1M
ALL
Question will resolve positive if someone trains a Mamba (https://twitter.com/tri_dao/status/1731728602230890895) language model with <=7.5billion parameters on <=2 trillion tokens that outperforms Llama2-13B on the huggingface open llm leaderboard (https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
Get
1,000
and1.00
Related questions
Related questions
What ELO will LLAMA 3.2 model get on LMSys?
Will the next major LLM by OpenAI use a new tokenizer?
76% chance
Will a 15 billion parameter LLM match or outperform GPT4 in 2024?
24% chance
Will the Jan 2024 version of the LLM detector "Binoculars" be effective against OpenAI's best model at end 2024?
59% chance
Will Llama 3-multimodal be natively mixed-multimodal? (VQ-VAE+next token prediction)
50% chance
Will a open source pure Mamaba LLM surpass 82 MMLU on MMLU (5-shot) before end of year 2024?
25% chance
Will any open-source model achieve GPT-4 level performance on MMLU through 2024?
83% chance
Will a single model running on a single consumer GPU (<1.5k 2020 USD) outperform GPT-3 175B on all benchmarks in the original paper by 2025?
85% chance
How many active parameters will the largest Llama 3 have?
77% chance
Will Llama 4 be the best LLM in the chatbot arena?
16% chance