Skip to content

WebCite: An On-Demand Internet Archive

As someone who studies Internet culture, one of my biggest problems is “link rot,” or broken links.  I’m a big fan of the Internet Archive, but they are usually six to eight months behind on even the most popular sites.  I also applaud sites like Wikipedia for providing stable version histories so that I can point to a specific revision of a page.  However, for all other websites, the only option is self-archiving, which is technically difficult and fraught with problems.  What I have found incredibly useful is WebCite, a free webpage archiving service that fills in this gap.

The process is incredibly easy.  You submit a URL with your e-mail and a few optional pieces of metadata, and WebCite will permenantly archive that URL.  For people who have a massive list of links they need to archive (like me), WebCite lets you upload an HTML file - all the anchor tags will be archived.   It does well with text, images are hit and miss, and plugins like flash are not supported.  Also, some websites (like the New York Times and CNN) have Javascript-based advertising redirects or anti-framing measures that make archiving impossible.

Still, it is better than nothing.  My standard citation practice for all sites is to search the Internet Archive first, and then use WebCite if I do not find the page I need.  It also provides a layer of accountability, as the header for each archived page shows the URL and when it was archived.  I’m sure there is some way to fool the site into archiving the wrong URL, but it is better than self-archiving.

WebCite is funded by a consortium headquartered at the University of Toronto, and they plan on making money through grants and institutional and subscriptions.  I’m a bit skeptical of this business model, but I guess it works for the Stanford Encyclopedia of Philosophy.

Share and Enjoy:
  • Digg
  • del.icio.us
  • Facebook
  • Mixx
  • Google
  • Reddit
  • Technorati
  • Ma.gnolia
  • StumbleUpon
Creative Commons LicenseGNU Free Documentation License 1.2 or later
The WebCite: An On-Demand Internet Archive by R. Stuart Geiger, unless otherwise expressly stated, is licensed under a Creative Commons Attribution-Share Alike 3.0 United States License. This work is also licensed under the GNU Free Documentation License 2.0 or greater, see http://www.gnu.org/copyleft/fdl.html.

2 Comments

  1. Olav A. wrote:

    It would be great if, say, the links in my Delicious account could be automatically saved to a service like this one and the whole content of the pages archived. Link rot is not the only problem, pages disappearing completely - being taken down - is also bad.

    Tuesday, August 26, 2008 at 1:58 pm | Permalink
  2. R. Stuart Geiger wrote:

    I completely agree. It shouldn’t be that hard to automate it - the service is just a POST form. I wish this would integrate with Zotero - a great bibliographic tool that automatically archives websites on your computer.

    Saturday, August 30, 2008 at 9:03 am | Permalink

Post a Comment

Your email is never published nor shared. Required fields are marked *
*
*