A Time Machine for the Internet?

Category: Reference

Have you ever wondered what the Internet looked like 25 years ago? The Internet Archive has been crawling the Web since 1996, taking snapshots of Web pages and preserving them for posterity. Today, approximately 835 billion Web pages, 44 million books, millions of images, audio recordings, and video, along with a million games and software programs are stored on the IA’s servers. Read on to learn how you can access the Wayback Machine, an amazing resource that offers a window into both the history and the present of the Internet...

What is The Internet Archive and Wayback Machine?

The Internet Archive was founded by Brewster Kahle, an Internet pioneer and computer programmer who helped to develop WAIS (Wide Area Information System), a text-based precursor to the World Wide Web. Kahle and others founded WAIS, Inc., to commercialize the text-searching technology; their clients included Ross Perot’s 1992 Presidential campaign, the EPA, the Library of Congress, the Dept. of Energy, the Wall Street Journal, and Encyclopedia Britannica.

WAIS, Inc., was sold to AOL in 1995 (which is why you've probably never heard of it) and Kahle went on to found The Internet Archive and the Alexa web stats service (not to be confused with Amazon’s Alexa virtual assistant).

The front-end to this massive library is the Wayback Machine (which fans of Dr. Peabody and Sherman will recognize). It allows journalists, researchers, and the nostalgically curious to search for older versions of Web pages, even if the pages no longer exist on the Web. If you want to see what Yahoo.com looked like in October 1996, or view snapshots of WhiteHouse.gov over time, it's in there.

The Wayback Machine

It also allows one to submit a web page address (URL) for archiving, and get a URL that will work even if the page is deleted or moved from its original site. These permanent links are increasingly important. Web URLs have gained widespread acceptance as citations in students’ term papers, Ph. D. dissertations, scientific research publications, even court filings and opinions. A “404 - not found” error is a big deal in a legal document, and the Wayback Machine helps avoid such problems. The Wayback Machine can search for archived copies of a missing page given its now-errant URL.

To make finding a lost page even easier, a browser extension is available for Chrome, or Edge, and an addon for Firefox. Once installed, it automatically searches the IA every time you run into one of the various "page not found" errors your browser may return when you try to fetch a web page. (In tech terms, that would be an error number 404, 408, 410, 451, 500, 502, 503, 504, 509, 520, 521, 523, 524, 525, or 526). If archived copies of the page are found, a notification window lets you choose whether to explore them.

But Wait, There's More!

Headquarters of Internet Archive, located in the former Fourth Church of Christ, Scientist, a neoclassic building with Greek columns on Funston Avenue, in Richmond District, San Francisco, California

The Internet Archive isn't just about Web pages, though. As part of its lofty goal "to provide Universal Access to All Knowledge," the folks at IA are busily scanning books into its databases, much like Google Books does. It also preserves copies of old video games (and the emulators need to play an Atari game on a PC), software, music, movies, videos, and even animated GIFs.

Some IA rabbit holes you may want to explore are the Internet Archive TV News, which has clips from almost 3 million TV news shows since 2009; the Movies page, which has full-length feature films, classic shorts and movie trailers; and the Cover Art Archive which has music album cover art images.

It's worth your time to browse the "Top Collections at the Archive," where you'll fund curated collections related to a wide variety of interests including Old Time Radio, MS-DOS Games, old magazines, and dozens of esoteric topics.

The headquarters of Internet Archive are located in the former Fourth Church of Christ, Scientist, a neoclassic building with Greek columns on Funston Avenue, in Richmond District, San Francisco, California. As of September 2024, the IA held over 145 petabytes of data. A petabyte is one million gigabytes. Wow.

One thing that was new since my last visit was Electric Sheep, a collection of animated and evolving fractal flames that make great screensavers. Let me know what you find there!

Your thoughts on this topic are welcome. Post your comment or question below.

 
Ask Your Computer or Internet Question

 
  (Enter your question in the box above.)

It's Guaranteed to Make You Smarter...

AskBob Updates: Boost your Internet IQ & solve computer problems.
Get your FREE Subscription!


Email:

Check out other articles in this category:



Link to this article from your site or blog. Just copy and paste from this box:

This article was posted by on 1 Oct 2024


For Fun: Buy Bob a Snickers.

Prev Article:
So You Dropped Your Phone in the...

The Top Twenty
Next Article:
Yes, You Do Need a Disposable Email Address

Most recent comments on "A Time Machine for the Internet?"

Posted by:

Doug
01 Oct 2024

I've used the Wayback Machine, which saved an entire site with 300 to 400 biographies and pictures.


Posted by:

Ian
01 Oct 2024

I wish the internet had not developed and stayed where it was 25 years ago. I don't like CSS; and although I like Javascript as a programming language, I can't stand it on websites. I also prefer plain text emails. In fact my website has an HTML only version but I am wary of using it in case people think I am computer illiterate.

And as for the days of pointless video art taking 40 seconds to move across your screen - a.k.a. Flash - thank goodness that is now extinct.

Maybe that my favourite author is Charlotte Brontë explains this.


Posted by:

John
02 Oct 2024

Bob - can you do a follow up piece on adding to the Archive.

By way of background - pushing the 'Update' button on a WordPress site recently wiped out the 'Theme', changing the site appearance for the worse. An early attempt to explain this to Web Support involved looking for dated content in the Archive. It wasn't there, though the site was 3-4 years old.

Is there a way to trigger the Archive to keep a copy of a page - or site - your own or someone else's?


Posted by:

FrancesMC
04 Oct 2024

And don't forget to donate!


Posted by:

BillK
06 Oct 2024

Yes, do not forget to donate; 145 petabytes and counting is not cheap.


Posted by:

Cole
06 Oct 2024

Thanks for writing about these. They are really handy in restoring some old blog you've run years ago.


Posted by:

Pete
09 Oct 2024

My wife and I have been on a kick of watching classic movies and the IA has often had quality copies of them. Works best to download fully, then watch on the device or have it screenshare to a smart TV. I try to remember to delete after watching because they can be somewhat large files. Next, wanting to see a couple of the oldest Marx Brothers films. Groucho and the rest.

Thankful.

I've looked at URLs on certain past dates with mixed success. Either way, it's an incredibly impressive data collection!

Oh. Sometimes, it's not a bad idea to stream a couple minutes of a download first. Rare but we have waited for a download only to find that it is in a different language or recorded in a noisy environment. The reason I download is: often when screen sharing while streaming I've had pixilation or choppy pic. Possibly you have superior speed or devices so it may not be necessary.


Post your Comments, Questions or Suggestions

*     *     (* = Required field)

    (Your email address will not be published)
(you may use HTML tags for style)

YES... spelling, punctuation, grammar and proper use of UPPER/lower case are important! Comments of a political nature are discouraged. Please limit your remarks to 3-4 paragraphs. If you want to see your comment posted, pay attention to these items.

All comments are reviewed, and may be edited or removed at the discretion of the moderator.

NOTE: Please, post comments on this article ONLY.
If you want to ask a question click here.


Free Tech Support -- Ask Bob Rankin
Subscribe to AskBobRankin Updates: Free Newsletter

Copyright © 2005 - Bob Rankin - All Rights Reserved
About Us     Privacy Policy     RSS/XML


Article information: AskBobRankin -- A Time Machine for the Internet? (Posted: 1 Oct 2024)
Source: https://askbobrankin.com/a_time_machine_for_the_internet.html
Copyright © 2005 - Bob Rankin - All Rights Reserved