Google releases Gemma 4 open models

ikt@aussie.zone · 1 month ago

Google releases Gemma 4 open models

afk_strats@lemmy.world · edit-2 1 month ago

I’ve done some testing with the two large models and my initial impression is that they seem very similar in quality to Qwen3.5 35B and 27B. Some notable exceptions:

llama.cpp has speculative decode support on day 1 and it speeds performance noticeably.
Day 1 base model release will undoubtedly lead to faster finetunes

Can’t wait for the inevitable Claude/Gemini distils.

My verdict is that even though these models benchmark slightly lower then their Qwen equivalents, their performance and support will likely drive me to pick them.

NerdsGonnaNerd@sh.itjust.works · 1 month ago

What hardware are you running them on? I am interested in selfhosting a llm myself but I am not sure which hardware I need. How do you think do these self hosted variants compare to for example claude sonnet 4.6?

SuspciousCarrot78@lemmy.world · edit-2 2 days ago

[deleted by user]

SuspciousCarrot78@lemmy.world · edit-2 2 days ago

[deleted by user]

NerdsGonnaNerd@sh.itjust.works · 25 days ago

Very interesting! Thank you for your detailed answer! :)

SuspciousCarrot78@lemmy.world · edit-2 2 days ago

[deleted by user]

pishadoot@sh.itjust.works · edit-2 15 days ago

Rookie question, forgive me:

How are the scores generated? How do you get 7/8.5 on a complicated ethical question? How are these scales even defined?

SuspciousCarrot78@lemmy.world · edit-2 2 days ago

[deleted by user]

pishadoot@sh.itjust.works · edit-2 14 days ago

Ok, I really really appreciate the depth you’ve put into your answers.

I always look at these grading rubrics people post for models and I’ve never seen an example of how they get ranked.

At this point I don’t think I’ll be ranking models myself, I’m not an enthusiast (yet) just running some ~30B models at home for various things and trying to stay afloat in what is a significantly more complicated ecosystem than I had imagined when I started.

But I really appreciate what you’ve written and I’m going to save all this.

Last questions - I see that you used Claude to come up with your test questions, right? How do you even validate the anchor answers if you’re not an expert in the field?

Do you do this professionally?

TheFinn@discuss.tchncs.de · 1 month ago

Are there open Claude models? I thought they were proprietary only

afk_strats@lemmy.world · edit-2 1 month ago

Not exactly. Some people have used Claude’s thinking patterns to train other open models.

This one has been in the top 5 on huggingface for weeks https://huggingface.co/Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Google releases Gemma 4 open models

Google releases Gemma 4 open models

Gemma 4 model card | Google AI for Developers