technically this has already been done (through clear data contamination) Should I assume this only resolves yes if there is no evidence of data contamination? Catch me if you can! How to beat GPT-4 with a 13B model | LMSYS Org
@DanielKokotajlo
I’ll count a model as open source if the model weights are accessible by people outside the organization.
Llama was originally released for researchers, and I would count this as open source for the purposes of this question.
If hackers put it on torrent, that’s open source too.
I realize this deviates from the definition of open source used in OSS communities. The spirit of the question is focused on malicious use and proliferation potential.
@mattt OK, thanks for the clarification. In that case this question is pretty much equivalent to mine I think: GPT4 or better model available for download by EOY 2024? | Manifold