
github.com
May 5, 2026
4 min read
67/100
Summary
GitHub repository angelos-p/llm-from-scratch offers a hands-on workshop for building a GPT training pipeline from scratch. The workshop focuses on reproducing the GPT-2 model with 124 million parameters using PyTorch.
Key Takeaways

Microgpt explained interactively
Mar 1, 2026

Autoresearch: Agents researching on single-GPU nanochat training automatically
Mar 7, 2026

CS336: Language Modeling from Scratch
Jun 1, 2026

Mr. Chatterbox is a Victorian-era ethically trained model
Mar 31, 2026

Nano-vLLM: How a vLLM-style inference engine works
Feb 2, 2026