@L_Acacia

L_Acacia · 2 years ago

Is there a way to download content from the community workshop using the steam download_depot ?

L_Acacia · 2 years ago

Are you on windows or linux, if you managed to fond the dlc files, you can most likely(not 100% sure it works with delisted) use creamAPI to make steam think you on them. On windows I’ve used CreamInstaller, which is a handy GUI that does it for you. I seem to recall I did it on my father computer running ubuntu, but don’t recall exactly how.

L_Acacia · edit-2 2 years ago

Scrubbles’s comment outlined what would likely be the best workflow. Having done something similar myself, here are my recommendations:

In my opinion, the best way to do STT with Whisper is by using Whisper Writer, I use it to write most most messages and texts.

For the LLM part, I recommend Koboldcpp. It’s built on top of llama.cpp and has a simple GUI that saves you from looking for the name of each poorly documented llama.cpp launch flag (cli is still available if you prefer). Plus, it offers more sampling options.

If you want a chat frontend for the text generated by the LLM, SillyTavern is a great choice. Despite its poor naming and branding, it’s the most feature-rich and extensible frontend. They even have an official extension to integrate TTS.

For the TTS backend, I recommend Alltalk_tts. It provides multiple model options (xttsv2, coqui, T5, …) and has an okay UI if you need it. It also offers a unified API to use with the different models. If you pick SillyTavern, it can be accessed by their TTS extension. For the models, T5 will give you the best quality but is more resource-hungry. Xtts and coqui will give you decent results and are easier to run.

There are also STS models emerging, like GLM4-V, but I still haven’t tried them, so I can’t judge the quality.

L_Acacia · 2 years ago

zen integrates every upstream change a few hours after release, it is built as a set of patch on top of firefox just to make that easy

L_Acacia · 2 years ago

they released a search engine where the model reads the first link before trying to answer your request

L_Acacia · 2 years ago

llama.cpp works on windows too (or any os for that matter), though linux will vive you better performances

L_Acacia · 2 years ago

Revolt tries to be a discord clone/replacement and suffer from some of the same issues. Matrix happens to have a lot of feature in common, but is focused on privacy and security at its core.

L_Acacia · 2 years ago

Mistral modèles don’t have much filter don’t worry lmao

L_Acacia · 2 years ago

They is no chance they are the one training it. It costs hundreds of millions to get a descent model. Seems like they will be using mistral, who have scrapped pretty much 100% of the web to use as training data.

L_Acacia · 2 years ago

Buying second hand 3090/7090xtx will be cheaper for better performances if you are not building the rest of the machine.

L_Acacia · 2 years ago

You are limited by bandwidth not compute with llm, so accelerator won’t change the interferance tp/s

L_Acacia · 2 years ago

I use similar feature on discord quite extensively (custom emote/sticker) and i don’t feel they are just a novelty. Allows us to have inside joke / custom reaction to specific event and I really miss it when trying out open source alternatives.

L_Acacia · 2 years ago

Too be fair to Gemini, even though it is worse than Claude and Gpt. The weird answer were caused by bad engineering and not by bad model training. They were forcing the incorporattion off the Google search results even though the base model would most likely have gotten it right.

L_Acacia · 2 years ago

The training doesn’t use csam, 0% chance big tech would use that in their dataset. The models are somewhat able to link concept like red and car, even if it had never seen a red car before.

L_Acacia · edit-2 2 years ago

The models used are not trained on CP. The models weight are distributed freely and anybody can train a LORA on his computer. Its already too late to ban open weight models.

L_Acacia · 2 years ago

Google uses their own chip for AI

L_Acacia · 2 years ago

They know the tech is not good enough, they just dont care and want to maximise profit.

L_Acacia · 2 years ago

Whatsapp is europe’s iMessage

L_Acacia · 2 years ago

You can take a look at exllama and llama.cpp source code on github if you want to see how it is implemented.

L_Acacia · 2 years ago

If you have good enough hardware, this is a rabbithole you could explore. https://github.com/oobabooga/text-generation-webui/