Model spec midtraining (MSM) trains models on synthetic documents about their Model Spec after pre-training and before alignment fine-tuning. MSM enhances generalization during alignment training and reduces agentic misalignment by influencing how models adopt values based on the Model Spec used.
alignment.anthropic.com
8 min
2d ago
Model spec midtraining (MSM) trains models on synthetic documents about their Model Spec after pre-training and before alignment fine-tuning. MSM enhances generalization during alignment training and reduces agentic misalignment by influencing how models adopt values based on the Model Spec used.
alignment.anthropic.com
8 min
2d ago
Model spec midtraining (MSM) trains models on synthetic documents about their Model Spec after pre-training and before alignment fine-tuning. MSM enhances generalization during alignment training and reduces agentic misalignment by influencing how models adopt values based on the Model Spec used.
alignment.anthropic.com
8 min
2d ago
No more articles to load