Running local LLMs? What's your model and hardware

Question

roscas · Accepted Answer

qwen3-coder:30bcodestral:22bcodegemma:7bcodellama:34bnorth-mini-code-1.0:q8_0laguna-xs.2:latestCurrently testing those above on AMD Ryzen 5 3600x with 48GB of RAM and a nVidia 3080 with 10GB of VRAM.Favorite model is laguna-xs.2 because it is really fast on CPU and very good.

cyanydeez · Answer

qwen 3.6 35B on 128GB strix halo.perfect speed to not melt the brain and can extend context for well scoped projects.need to work with dynamic context pruning to ensure full reuse in larger projects.deer-flow seems. to work well for project scoping and high level evals. opencode for coding.

Running local LLMs? What's your model and hardware

Running local LLMs? What's your model and hardware

qwen3-coder:30b
codestral:22b
codegemma:7b
codellama:34b
north-mini-code-1.0:q8_0
laguna-xs.2:latest
Currently testing those above on AMD Ryzen 5 3600x with 48GB of RAM and a nVidia 3080 with 10GB of VRAM.
Favorite model is laguna-xs.2 because it is really fast on CPU and very good.

qwen 3.6 35B on 128GB strix halo.
perfect speed to not melt the brain and can extend context for well scoped projects.
need to work with dynamic context pruning to ensure full reuse in larger projects.
deer-flow seems. to work well for project scoping and high level evals. opencode for coding.