
fergusfinn.com
June 2, 2026
8 min read
52/100
Summary
DeepSeek-V4-Flash is being implemented on the AMD MI300X, which launched in December 2023 as AMD's competitor to NVIDIA's H100 and H200 AI accelerators. The MI300X aims to address the current compute shortage while building an inference cloud for high-volume AI tasks.
Key Takeaways

DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles
Apr 25, 2026

A 10 year old Xeon is all you need
Jun 1, 2026

DeepSeek V4–almost on the frontier, a fraction of the price
May 1, 2026

We got 207 tok/s with Qwen3.5-27B on an RTX 3090
Apr 20, 2026
Flash-MoE: Running a 397B Parameter Model on a Laptop
Mar 22, 2026