Easy. It’s the same people behind both LLMs. They just told a developer to code a program that makes an app that designs an algorithm that makes a neural network that puts all the bad stuff into one LLM and all the good stuff into another and that’s how you get qwen
IIRC Qwen 2.5 is old and a much smaller model than DeepSeek 3.1, but also a lot of the models are outdated. I know that Qwen makes models that are more comparable to DeepSeek.
Meanwhile on deepseek:
Perfectly displaying my local time in real time indistinguishable from an actual clock. China stays winning.
The Qwen model is also Chinese
Lol. Wonder what went wrong there.
Easy. It’s the same people behind both LLMs. They just told a developer to code a program that makes an app that designs an algorithm that makes a neural network that puts all the bad stuff into one LLM and all the good stuff into another and that’s how you get qwen
Not enough good training data
IIRC Qwen 2.5 is old and a much smaller model than DeepSeek 3.1, but also a lot of the models are outdated. I know that Qwen makes models that are more comparable to DeepSeek.
A temperature greater than zero