Print

Print


Hi Christine,

I studied this issue quite a bit over the last year and many archivists in
North America use HTTrack (http://www.httrack.com/). It is free and open
source. However, it depends what you want to archive and why...Did you need
to try and preserve the look and feel of the site along with the content, or
is it just the look and feel? As Elizabeth mentioned, Flash and some other
aspects of a web site can be problematic for harvesting so there might be
some testing needed to find the right program for you.

Cheers

Lisa
Lisa Snider
MAS/MLIS Student at UBC SLAIS
Senior Web Developer TSW



On Tue, Apr 12, 2011 at 6:32 AM, Christine Martin <[log in to unmask]> wrote:

>
>
> One of the organizations I work for (a private foundation in Chicago) is
> trying to decide whether (and how) to digitally preserve (or "archive") its
> web site.
>
>
>
> The foundation does not take money or handle financial transactions over
> its
> web site.  The web site contains primarily publications, e.g., news
> releases, annual reports, newsletters, and the like.
>
>
>
> My question is:  What software or procedures do you use to preserve your
> organization's web site as it changes over time?  Our web developer has
> suggested that we use WGet (a web crawler) to capture our web site as it
> appears to the public and then use SubVersion (another software
> product-used
> for version control?) to catalog any aspects of the web site that have
> changed.  In this way, we store only the base web site plus incremental
> changes, as opposed to storing multiple copies of portions of the web site
> that have not changed.
>
>
>
> Have any of you done (or attempted) anything similar?  If so, I would love
> to hear what you did and how it went.  This is fairly new territory to me,
> and any words of advice, warning, or encouragement would be most welcome.
>
>
>
> Thank you.
>
>
>
> Sincerely,
>
>
>
> Christine Martin
>
> Contract records manager
>
> Des Plaines, IL
>
> 224-636-2457 (cellular)
>
>
>
>
>
>
> List archives at http://lists.ufl.edu/archives/recmgmt-l.html
> Contact [log in to unmask] for assistance
> To unsubscribe from this list, click the below link. If not already
> present, place UNSUBSCRIBE RECMGMT-L or UNSUB RECMGMT-L in the body of the
> message.
> mailto:[log in to unmask]
>

List archives at http://lists.ufl.edu/archives/recmgmt-l.html
Contact [log in to unmask] for assistance
To unsubscribe from this list, click the below link. If not already present, place UNSUBSCRIBE RECMGMT-L or UNSUB RECMGMT-L in the body of the message.
mailto:[log in to unmask]