Self-distillation (SSD) enables large language models to enhance code generation by using their own raw outputs without the need for a verifier or teacher model. The process involves sampling solutions with specific temperature and truncation settings, followed by fine-tuning.
arxiv.org
2 min
4/4/2026
Self-distillation (SSD) enables large language models to enhance code generation by using their own raw outputs without the need for a verifier or teacher model. The process involves sampling solutions with specific temperature and truncation settings, followed by fine-tuning.
arxiv.org
2 min
4/4/2026
Self-distillation (SSD) enables large language models to enhance code generation by using their own raw outputs without the need for a verifier or teacher model. The process involves sampling solutions with specific temperature and truncation settings, followed by fine-tuning.
arxiv.org
2 min
4/4/2026
No more articles to load