ollama 0.11.9 Introducing A Nice CPU/GPU Performance Optimization

cm0002@piefed.world · 2 months ago

ollama 0.11.9 Introducing A Nice CPU/GPU Performance Optimization

hendrik@palaver.p3x.de · 2 months ago

I think llama.cpp merged ROCm support in 2023 already. It’s called HIP on their Readme, but I’m not super educated on all the acronyms and compute frameworks and instruction sets.

afk_strats@lemmy.world · 2 months ago

ROCm is a software stack which includes a bunch of SDKs and API.

HIP is a subset of ROCm which lets you program on AMD GPUs with focus portability from Nvidia’s CUDA