
vas-blog.pages.dev
May 19, 2026
63 min read
48/100
Summary
A mechanistic-interpretability study of Qwen 3.5 Disclaimer. This is a mechanistic-interpretability study of how nation-state-mandated content filtering actually gets built into a deployed LLM's weights. It's not meant to support or oppose political censorship, and it takes no position on the historical events, policies, or governments referenced in the prompts. Readers in mainland China should fo...