Guide
Best Local LLMs for Coding
Pick a local coding model based on quality, speed, and whether it fits your GPU without constant swapping.
- Prioritize code quality and instruction following before raw token speed.
- The ~20B parameter class, led by gpt-oss:20b, dominates the quality charts while retaining lightning-fast speeds.
- Use the benchmark pages to compare the same model across different hardware and tools.