You must log in or # to comment.
Mira: So why is “gpt-3.5-turbo-instruct” so much better than GPT-4 at chess? Probably because 6 months ago, someone checked in a chess eval in OpenAI’s evals repo.
I think this is silly. The eval tests it on 101 chess positions. That’s fine-tuning, not training.