Hello World

SkySyrup@sh.itjust.works · 3 years ago

Hello World

scrollbars@lemmy.ml · 3 years ago

Hello! This is the one community that I was a bit worried about finding an equivalent of outside of reddit. Hopefully more of us migrate over.

dirac_field · 3 years ago

Late to the party, but thanks for setting this up! I suspect the overlap of people both using local LLMs and hungry for reddit alternatives will be higher than average

mellery · 3 years ago

Hello! Thanks for setting this up

hendrik@lemmy.ml · 3 years ago

thank you for using a decent platform. i doubt more than 20 people will migrate from reddit… but it make the world a better place, anyways.

Barbarian@sh.itjust.works · edit-2 3 years ago

You should make a post in !findacommunity@lemmy.ml

pax@sh.itjust.works · 3 years ago

I could help with moderation, but I have a question, how to set up LLAma on my mac computer? any tips?

SkySyrup@sh.itjust.works · edit-2 3 years ago

Hi, sure, thank you so much for helping out! As for LLaMA, I would point you at llama.cpp, (https://github.com/ggerganov/llama.cpp) which is the absolute bleeding edge, but also has pretty useful instructions on the page (https://github.com/ggerganov/llama.cpp#usage). You could also use Kobold.cpp, but I don’t have any experience with it, so I can’t help you if you have issues.

gh0stcassette@lemmy.world · 3 years ago

Adding to this: text-generation-webui (https://github.com/oobabooga/text-generation-webui) works with the latest bleeding edge llama.cpp via llama-cpp-python, and it has a nice graphical front-end. You do have a manually tell pip to install llama.cpp-python with the right compiler flags to get GPU acceleration working but the llama-cpp-python github and ooba github explain how to do this.

You can even set up GPU acceleration through metal on m1 Macs I’ve seen some fucking INSANE performance numbers online for the higher RAM MacBook pros (20+ tokens/sec, I think with a 33b model, but it might have been 13b, either way, impressive.)

pax@sh.itjust.works · 3 years ago

llama cpp is crashy on my computer, it even didn’t compiled.

SkySyrup@sh.itjust.works · 3 years ago

Huh, that’s interesting. If llama.cpp doesn’t work, try https://github.com/oobabooga/text-generation-webui which (tries to) provides a user-friendly(-ier) experience.

pax@sh.itjust.works · 3 years ago

it launches just fine, but when loading a model it says something like: successfully loaded none

SkySyrup@sh.itjust.works · 3 years ago

Have you put your model in the “models” folder in the “text-generation-webui” folder? If you have, then navigate over to the “Model” section (button for the menu should be at the top of the page) and select your model using the box below the menu.

pax@sh.itjust.works · 3 years ago

I tried to download an example one, cus I don’t have any model, failed.

SkySyrup@sh.itjust.works · 3 years ago

I’d recommend the model Wizard-Vicuna-7b-Uncensored (i know it’s like a sentence https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-GGML) direct download link is here: https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-GGML/blob/main/Wizard-Vicuna-7B-Uncensored.ggmlv3.q5_1.bin

dtlnx@beehaw.org · 3 years ago

Try this. It works great for me.

https://gpt4all.io/index.html

pax@sh.itjust.works · 3 years ago

gpt4all is dump, it even didn’t tried to be smart.