Best local LLM performance for RX 7900 XTX

Best Local LLM Performance for RX 7900 XTX

Compare RX 7900 XTX benchmark results and see which models actually earn the best completed quality scores.

Hardware Overview

GPU-specific specifications for local LLM planning.

AMD RX 7900 XTX
VRAM
24GB
Architecture
RDNA 3
MEMORY BANDWIDTH
960.0 GB/s

Recommended Models

Models we recommend for the Radeon RX 7900 XTX.

Fastest benchmarked model

gpt-oss:20b

20.9B

73.3 tok/s

This benchmark delivered the highest token generation speed at 73.3 tok/s.

Best-quality benchmarked model

qwen3.6:latest

36.0B

6.1 tok/sQuality 82.5

This benchmark has the highest completed quality score at 82.5.

Largest context that still fits in VRAM

qwen/qwen3.6-35b-a3b

35B

32.4 tok/sContext 16,384 tokens

This benchmark fits within 24GB VRAM while offering a 16,384 token context window.

Benchmark Results

Top completed quality scores for AMD RX 7900 XTX, capped at 10 results.