The Wayback Machine - https://web.archive.org/web/20090323093002/http://www.computerworld.com:80/action/article.do?command=viewArticleBasic&taxonomyName=hardware&articleId=9130081&taxonomyId=12&intsrc=kc_top
Subscribe to our e-mail newsletters
For more info on a specific newsletter, click the title. Details will be displayed in a new window.
Hardware
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
More E-Mail Newsletters 
 


Ads by TechWords

See your link here
 
Computerworld 2007Subscribe to Computerworld
40 years of the most authoritative source of news and information for IT leaders.

Internet Archive to unveil massive Wayback Machine data center

The Wayback Machine stores 85 billion Web pages dating back to '96

March 19, 2009 (Computerworld) The Internet Archive organization plans next week to announce the opening of a new data center to house two petabytes of information for its Wayback Machine, the digital time capsule that stores archived versions of Web pages dating back to 1996.

For example, this is what Computerworld's Web site looked like in 1997, what Google looked like in 1998 and what CNN looked like in 2000.

The Wayback Machine houses 85 billion Web pages archived for more than a dozen years, which amounts to three petabytes of data, or about 150 times the content of the Library of Congress. Only five years ago, the Wayback Machine contained about 30 billion Web pages. It is expected to continue to grow by 100TB of data per month now that it's live.

The Internet Archive's massive database is mirrored to the Bibliotheca Alexandrina, the new Library of Alexandria in Egypt, for disaster recovery purposes.

According to an event invitation from Sun Microsystems Inc., the Internet Archive is moving from a traditional data center filled with standard Linux servers to one that runs Solaris 10 with ZFS on Sun Fire x4500s servers inside a Sun Modular Datacenter. The modular system is an all-in-one data center housed in a metal shipping container for mobility.

Because of the modular design, Sun said the data center was deployed in a tenth of the time it would take to build a typical bricks-and-mortar data center. The Wayback Machine Sun Modular Datacenter can service 500 inquiries a second, Sun said. A spokesperson for the Internet Archive said the user interface on the Wayback Machine will not change.

The Internet Archive is a nonprofit organization located in the Presidio in San Francisco, with data centers in Redwood City and Mountain View, Calif. The archive not only keeps snapshots of Web pages, but also software, movies, books, and audio clips.

Users can surf the Wayback Machine by typing in the Web address of a Web site or Web page and then choose from a series of dates that reflect the stored images. The site does not currently support keyword search.



What People Are Saying

White Papers & Webcasts

Virtualize Servers with Microsoft Hyper-V and NetApp
(Source: NetApp) Learn how NetApp technologies help provide data protection, disaster recovery, deduplication, thin provisioning, and efficient cloning for faster virtual server provisioning...

Windows Vista Demystified: Getting Started on Windows 7 Migration Now
(Source: Dell) This podcast discusses how Windows Vista Enterprise can reduce the cost of desktop deployment with hardware- and language-independent images, configuration...  

NetApp and Microsoft Virtualization Storage Best Practices
(Source: NetApp) This technical report provides guidelines and best practices for integrated architectures and implementations of Microsoft® server virtualization technologies with NetApp® storage...

The Clock Is Ticking on Windows XP: Why You Should Start Planning Your Windows Vista® Migration Now
(Source: Dell) If you've been putting off migrating to Windows Vista® and are missing the advantages of enhanced security, exceptional mobility and a...  

NetApp and Microsoft Virtualization: Making Integrated Server and Storage Virtualization a Reality
(Source: NetApp) This document provides technical insight into the unique value propositions of virtual storage solutions from NetApp for a Microsoft® Hyper-V environment...

Windows Vista® Migration: What Really Happens? A Customer's Perspective
(Source: Dell) There is no end to commentary about migration to Windows Vista®, and unfortunately, a lot of it is opinion, guesswork and,...  

Virtualizing Microsoft Exchange
(Source: IDG) What are your peers doing in terms of virtualizing Exchange? This 2008 Computerworld Research study explores initiatives surrounding virtualizing Microsoft Exchange....

The State of the Great Windows Vista® Migration and the Latest Research on What Real Customers Say
(Source: Dell) Windows Vista® provides benefits that really do make a difference both for end users and IT professionals. View this eSeminar for...  

NetApp and VMware Virtual Infrastructure 3 Storage Best Practices
(Source: NetApp) NetApp has been providing advanced storage features to VMware ESX solutions since the product began shipping in 2001. During that time,...

Liberation through Automation: How You Can Automate Your PC Deployment and Help Free Up IT Resources
(Source: Dell) Dell and Microsoft provide solutions that can dramatically improve PC and software deployments through better tools and automated processes so your...