Ask HN: What does your local LLM setup looks like? What models and hardware you are using? For what purpose? What were the challenges? Any tricks that helped you in doing this? This might help new users like me setup theirs. |
Ask HN: What does your local LLM setup looks like? What models and hardware you are using? For what purpose? What were the challenges? Any tricks that helped you in doing this? This might help new users like me setup theirs. |
If I can go back in time, I would probably buy a more AI dedicated machine but I also don't regret finally being able to play Cyberpunk in 4k with great FPS and overkill mods.
I've mostly enjoyed having WSL to leverage Linux dev tools, but it seems like it's still adding overhead that prevents me from taking advantage of the GPU in full, so I'll likely get another drive and install Linux.
I tried Qwen, Llama, Mistral and Gemma. Gemma 4 was pretty impressive.
Runs pretty well with Ollama on the Qwen models. It seems like Qwen has done a great job with speed.