Fastest benchmarked model
gpt-oss:20b
20.9B
73.3 tok/s
This benchmark delivered the highest token generation speed at 73.3 tok/s.
Best local LLM performance for RX 7900 XTX
Compare RX 7900 XTX benchmark results and see which models actually earn the best completed quality scores.
GPU-specific specifications for local LLM planning.
Models we recommend for the Radeon RX 7900 XTX.
Fastest benchmarked model
20.9B
This benchmark delivered the highest token generation speed at 73.3 tok/s.
Best-quality benchmarked model
36.0B
This benchmark has the highest completed quality score at 82.5.
Largest context that still fits in VRAM
35B
This benchmark fits within 24GB VRAM while offering a 16,384 token context window.
Top completed quality scores for AMD RX 7900 XTX, capped at 10 results.