ChatGPT can get worse over time, Stanford study finds | Fortune

@cyu@sh.itjust.works · 1 year ago

ChatGPT can get worse over time, Stanford study finds | Fortune

Southern Wolf · edit-2 1 year ago

This has already been disproven, due to the fact the method the researchers used to test how well it was doing was flawed to begin with. Here is a pretty good twitter-thread showing why the methods they used were flawed: https://twitter.com/svpino/status/1682051132212781056

TL:DR: They used an approach of only giving it prime numbers, and asking it if they were prime numbers. They didn’t intersperse prime and non-prime numbers to really test it’s capabilities at determining that. Turns out that if you do that, both the early and current versions of GPT4 are equally bad at determining prime numbers, with effectively no change noted between the versions.