You probably shouldn't trust the info anyway.

The Picard Maneuver@lemmy.world · 1 year ago

You probably shouldn't trust the info anyway.

BaroqueInMind · edit-2 1 year ago

Hermes3 is better in every way.

If anyone is reading this, your fucking gaming PC can run a 8B model of Hermes, and with the correct initial system prompt will be as smart as ChatGPT4o.

Here’s how to do it.

Possibly linux@lemmy.zip · edit-2 1 year ago

https://ollama.com/library/hermes3

I personally don’t use it as it isn’t under an open license.

BaroqueInMind · edit-2 1 year ago

What are you talking about? It follows the Llama 3 Meta license which is pretty fucking open, and essentially every LLM that isn’t a dogshit copyright-stealing Alibaba Quen model uses it.

Edit: Mistral has an almost similar license that Meta released Llama 3 with.

Both Llama 3 and Mistral AI’s non-production licenses restrict commercial use and emphasize ethical responsibility, Llama 3’s license has more explicit prohibitions and control over specific applications. Mistral’s non-production license focuses more on research and testing, with fewer detailed restrictions on ethical matters. Both licenses, however, require separate agreements for commercial usage.

Tl:Dr Mistral doesn’t give two fucks about ethics and needs money more than Meta

Possibly linux@lemmy.zip · 1 year ago

Mistral is licensed under the Apache license version 2.0. This license is recognized under the GNU project and under the Open source initiative. This is because it protects your freedom.

Meanwhile the Meta license places restrictions on use and arbitrary requirements. It is those requirements that lead me to choose not to use it. The issue with LLM licensing is still open but I certainly do not want a EULA style license with rules and restrictions.

BaroqueInMind · 1 year ago

You are correct. I checked HuggingFace just now and see they are all released under Apache license. Thank you for the correction.

bruhduh@lemmy.world · edit-2 1 year ago

Is hermes 8b is better than mixtral 8x7b?

BaroqueInMind · edit-2 1 year ago

Hermes3 is based on the latest Llama3.1, Mixtral 8x7B is based on Llama 2 released a while ago. Take a guess which one is better. Read the technical paper, it’s only 12 fucking pages.

MystikIncarnate@lemmy.ca · 1 year ago

Okay, but fucking pages sounds like a good way to get papercuts in places I don’t want papercuts.