News

News publishers limit Internet Archive access due to AI scraping concerns

  • Andrew Deck--Niemanlab.org
  • published date: 2026-01-28 20:09:59 UTC

As part of its mission to preserve the web, the Internet Archive operates crawlers that capture webpage snapshots. Many of these snapshots are accessible through its public-facing tool, the Wayback Machine. But as AI bots scavenge the web for training data to…

As part of its mission to preserve the web, the Internet Archive operates crawlers that capture webpage snapshots. Many of these snapshots are accessible through its public-facing tool, the Wayback M… [+11500 chars]