Web Harvester

Share this on social media:

Web Harvester from OCLC is said to allow libraries and other cultural heritage institutions to capture and add web content to their digital collections that are managed by OCLC’s CONTENTdm Digital Collection Management Software.

This tool, which evolved from collaboration with several state libraries, state archives and universities over a period of seven years, addresses the need to store and provide access to otherwise highly transient information resources that solely exist on web sites.

The Web Harvester is integrated into library workflows, allowing library staff to capture content as part of the cataloging process. The captured content is then sent to the organisation's digital collections where it can be managed with other CONTENTdm digital content.

The tool is accessed via the Connexion client, OCLC's cataloging service. It captures content ranging from single, web-based documents to entire web sites. Harvested items are discoverable from WorldCat.org, WorldCat Local and the CONTENTdm Web interface. Master files of the captured content also can be ingested to the OCLC Digital Archive, the service for long-term storage of originals and master files from libraries' digital collections.

The Web Harvester is an optional product for current Hosting users of CONTENTdm to expand their ability to collect, manage and provide access to digital content.