SWAT - Snappy Web Archiving Tool


SWAT is a tool designed for archiving web sites and displaying the archive in a simple and pedagogical way. Besides harvesting the original files from the web site, SWAT generates snapshots of each page to TIFF files and describes the entire archive in a METS-file. By generating snapshots with an arbitrary rendering engine (webkit, trident, gecko etc.) one can ensure that future generations will understand how the page looked like without having to render the html themselves. The system is coded in Ruby and is accessed via a web application that is built with the Rails framework.

The work flow in SWAT consists of these steps: