Best local LLM performance for RX 7900 XTX

Best Local LLM Performance for RX 7900 XTX

Compare RX 7900 XTX benchmark results and see which models actually earn the best completed quality scores.

Hardware Overview

GPU-specific specifications for local LLM planning.

AMD RX 7900 XTX
VRAM
24GB
Architecture
RDNA 3
MEMORY BANDWIDTH
960.0 GB/s

Recommended Models

Models we recommend for the Radeon RX 7900 XTX.

Fastest benchmarked model

openai/gpt-oss-20b

Unknown

111.1 tok/s

This benchmark delivered the highest token generation speed at 111.1 tok/s.

Best-quality benchmarked model

qwen3.6:latest

36.0B

6.1 tok/sQuality 82.5

This benchmark has the highest completed quality score at 82.5.

Largest context that still fits in VRAM

openai/gpt-oss-20b

20B

108.3 tok/sContext 256,000 tokens

This benchmark fits within 24GB VRAM while offering a 256,000 token context window.

Benchmark Results

Completed quality-ranked benchmark results for AMD RX 7900 XTX.