Hellfire103@lemmy.ca to Not The Onion@lemmy.worldEnglish · 1 年前OpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Uswww.404media.coexternal-linkmessage-square122fedilinkarrow-up11.16K
arrow-up11.16Kexternal-linkOpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Uswww.404media.coHellfire103@lemmy.ca to Not The Onion@lemmy.worldEnglish · 1 年前message-square122fedilink
minus-squareAvid Amoeba@lemmy.calinkfedilinkEnglisharrow-up23·1 年前Is there evidence that DeepSeek is an OpenAI distillate other than OpenAI and Co’s protestations?
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up36·edit-21 年前It’s literally impossible. I tried to explain it here: https://lemmy.world/comment/14763233 But the short version is OpenAI doesn’t even offer access to the data you need for a “distillation,” as the term is used in the LLM community. Of course there’s some OpenAI data in the base model, but that’s partially because it’s splattered all over the internet now.
minus-squaremorrowind@lemmy.mllinkfedilinkEnglisharrow-up6·1 年前Not distillate, they just trained on the outputs of openai
Is there evidence that DeepSeek is an OpenAI distillate other than OpenAI and Co’s protestations?
It’s literally impossible. I tried to explain it here: https://lemmy.world/comment/14763233
But the short version is OpenAI doesn’t even offer access to the data you need for a “distillation,” as the term is used in the LLM community.
Of course there’s some OpenAI data in the base model, but that’s partially because it’s splattered all over the internet now.
Thank you 🙏
Not distillate, they just trained on the outputs of openai