
lmsys.org
April 25, 2026
17 min read
44/100
Summary
DeepSeek-V4 is now supported for both inference and reinforcement learning (RL) training from Day 0. SGLang and Miles provide the first open-source stack designed for DeepSeek-V4’s hybrid sparse-attention architecture and manifold-constrained hyper-connections, utilizing FP4 expert weights.
Key Takeaways

LLM Neuroanatomy II: Modern LLM Hacking and Hints of a Universal Language?
Mar 24, 2026

Step 3.5 Flash – Open-source foundation model, supports deep reasoning at speed
Feb 19, 2026

We got 207 tok/s with Qwen3.5-27B on an RTX 3090
Apr 20, 2026
Flash-MoE: Running a 397B Parameter Model on a Laptop
Mar 22, 2026

LLM Architecture Gallery
Mar 15, 2026