SmokeyDope@lemmy.worldM to LocalLLaMA@sh.itjust.worksEnglish · edit-27 days agoDeepSeek just released updated r1 models with 'deeper and more complex reasoning patterns'. Includes a r1 distilled qwen3 8b model boasting "10% improved performance" over originalhuggingface.coexternal-linkmessage-square9fedilinkarrow-up127file-text
arrow-up127external-linkDeepSeek just released updated r1 models with 'deeper and more complex reasoning patterns'. Includes a r1 distilled qwen3 8b model boasting "10% improved performance" over originalhuggingface.coSmokeyDope@lemmy.worldM to LocalLLaMA@sh.itjust.worksEnglish · edit-27 days agomessage-square9fedilinkfile-text
minus-squareBaroqueInMindlinkfedilinkEnglisharrow-up1·edit-26 days agoI can’t find any abliterated models of this new release that aren’t quantized to shit and are GGUF to work with my Ollama instance
I can’t find any abliterated models of this new release that aren’t quantized to shit and are GGUF to work with my Ollama instance