
niemanlab.org
May 21, 2026
11 min read
62/100
Summary
In January, Nieman Lab broke the story that major news publishers — including The New York Times, The Guardian, and USA Today Co. — had started blocking the Internet Archive due to concerns that AI companies might scrape the nonprofit’s repositories for training data. No news publisher has confirmed to Nieman Lab that an AI company has already scraped their content from the Wayback Machine. Still,...