Digital Archiving Revived With Archive-It 2.0, New Version of Web Archiving Service

Share Article

This news release announces a new version of Internet Archive's Archive-It service, Archive-It 2.0. Archive-It 2.0 works to preserve all information stored in libraries, museums and other memory institutions. This news is intended for all users of the World Wide Web who search on the internet for all types of archived information.

As information is increasingly digitally created, communicated and consumed, memory institutions such as archives, libraries and museums face a daunting task of preserving endless quantities of web material. Internet Archive has already recognized this challenge and today released Archive-It 2.0, the latest version of its subscription-based archiving service. Serving a broad range of scholarly organizations at a cost significantly lower than other archive platforms, Archive-It 2.0 enables subscribers to collect, manage, search and preserve online material from their own institutions’ websites as well as from the World Wide Web.

Archive-It 2.0 enables digital archivists, library and museum professionals to create more tailored, relevant and search-friendly collections of up to ten million URLs based on regular Web crawls across selected websites. Through “test crawls,” subscribers may see what kind of web material would populate a certain collection before actually archiving them permanently. An optional paid feature within Archive-It 2.0, Archive-It Pro, allows subscribers to not only set caps on how many web documents are collected from a website over time, but also block the collection of materials from specific websites altogether. The digital collections, as a result, are focused and more easily managed, because irrelevant materials do not find their way into an institution’s archives.

Existing partners include the University of Toronto, Indiana University and North Carolina State Archives, alongside other respected academic and memory institutions that have capitalized on the preassembled archiving infrastructure that Archive-It offers. Collections seeded and managed by these institutions through Archive-It 2.0 range in topics from national government statistic websites to Hurricane Katrina and many more, all open for public exploration and use.

"Archive-It is a great service for the comprehensive capture of websites based on the collecting needs of libraries,” says Carolyn Palaima of University of Texas Libraries & LANIC. She adds, “Archive-It has also opened new possibilities for collaborative collecting.”

About Internet Archive

Internet Archive is a 501(c) non-profit organization headquartered in San Francisco, California, with facilities also in Amsterdam and Alexandria, Egypt. It was founded in 1996 with a mission to build an “Internet library” providing permanent access for researchers, historians, and scholars to historical collections that exist in digital form. Built on open source software developed by Internet Archive and the International Internet Preservation Consortium, Internet Archive itself is the largest publicly available web archive in existence. It currently archives 65 billion pages from 50 million websites worldwide, including texts, audio, moving images, and software as well as archived web pages in our collections. In 2002, Internet Archive began to form partnerships with Library of Congress, NARA, and other national libraries, including those in Australia, France and Italy.

Media and Analyst Contact:

Lara Roubinian

Antenna Group



Share article on social media or email:

View article via:

Pdf Print

Contact Author

Lara Roubinian
Visit website