
imil.net
June 13, 2026
5 min read
51/100
Summary
An RTX 5080 and RTX 3090 setup achieves over 80 tokens per second on the Qwen 3.6 27B Q8 model. The RTX 3090, with 24GB of memory, significantly enhances performance, allowing for initial speeds of 30 tokens per second, increasing to 50-60 tokens per second with MTP.
Key Takeaways
Community Sentiment
Positives
Concerns