
arxiv.org
May 18, 2026
2 min read
48/100
Summary
Computer Science > Computation and Language [Submitted on 15 Jan 2026 (v1), last revised 19 Feb 2026 (this version, v2)] Title:Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment View PDF HTML (experimental)Abstract:Pretraining corpora contain extensive discourse about AI systems, yet the causal influence of this discourse on downstream alignment remains poorly understood. If...