
github.com
May 15, 2026
2 min read
52/100
Summary
Official implementation and model checkpoints for Orthrus, a dual-architecture framework that unifies the exact generation fidelity of autoregressive Large Language Models (LLMs) with the high-speed parallel token generation of diffusion models. demo_orthrus.mp4 All models use a Qwen3 backbone and guarantee strictly lossless generation. | Model | Base Model | HuggingFace | Avg. Speedup | |---|---|...