Testing the Limits: My GTX 1070 Rig vs Mistral Small 22B

SmokeyDope@lemmy.world · 1 year ago

Testing the Limits: My GTX 1070 Rig vs Mistral Small 22B

BaroqueInMind · 1 year ago

Read up on Hermes3 technical paper and you’ll realize it’s the best one. Running 8B model with the correct initial system prompt makes it as smart as GPT4o

SmokeyDope@lemmy.world · 1 year ago

The linked paper was a good read. Thank you.

BaroqueInMind · edit-2 1 year ago

Ironically, if you ask ChatGPT to write you an initial system prompt for Hermes that will sound similar to its own, it will essentially share a trade secret with you and give up portions of its system prompt to make your 8B self hosted LLM perform like a commercial one.