How Much of the Web Is Archived? Truth Is, We Don’t Really Know

Here’s the challenge: new Internet is being made all the time. Oftentimes, these new pages are added to existing networks on Tumblr or Facebook or Twitter or Livejournal. But other times, someone fires up a web server that’s off the standard map, and it the web’s crawlers, try as they might, may not find that page for a while, if ever.That means some percentage of the web is not being archived by anyone (or anything, really), not even the Internet Archive’s invaluable Wayback machine.
http://www.theatlantic.com/technology/archive/2013/01/how-much-of-the-web-is-archived-truth-is-we-dont-really-know/266905/

Leave a Reply

Your email address will not be published.

This site uses Akismet to reduce spam. Learn how your comment data is processed.