
lmsys.org
April 25, 2026
17 min read
48/100
Summary
DeepSeek-V4 is now supported for both inference and reinforcement learning (RL) training from Day 0. SGLang and Miles provide the first open-source stack designed for DeepSeek-V4’s hybrid sparse-attention architecture and manifold-constrained hyper-connections, utilizing FP4 expert weights.
Key Takeaways

Bringing Up DeepSeek-V4-Flash on AMD MI300X
Jun 2, 2026

DeepSeek V4–almost on the frontier, a fraction of the price
May 1, 2026

DeepSeek 4 Flash local inference engine for Metal
May 7, 2026

LLM Neuroanatomy II: Modern LLM Hacking and Hints of a Universal Language?
Mar 24, 2026

Step 3.5 Flash – Open-source foundation model, supports deep reasoning at speed
Feb 19, 2026