{% extends "base.html" %} {% block overhead %}{% endblock %} {% block path %}{% endblock %} {% block pagetitle %}{% endblock %} {% block postnav %}

Internet Archive is a non-profit library holding millions of books, websites, recordings, and other digitized works.

Since 1996 our mission has been to ensure Universal Access to All Knowledge.

Wayback Machine is a great project for sure! We have been saving copies of popular web pages on a regular basis since 1996, allowing anybody to travel back in time (be it weeks or decades) and view web content as originally posted.

Open Library is an amazing gift to humanity! We have collected and/or scanned millions of books which are free borrow or download without restriction.

Amazing! We've been recording live television since the year 2009 or so, and have many hours of footage that can be viewed right now in the browser!

Coming soon: radio content

Yup, we've even got music too. Like tons of early American 78 RPM records that we've digitized with George Blood, and thousands of hours of live concert recordings made by fans. Listen up! You can stream or download for offline access.

Oh right, film and video, we have a bunch of that also! Like the Prelinger Archives of historical Bay Area street content, tons of educational and government reels, and complete out-of-copyright early films. For example, check out this anti-fascist reel.

If you thought we might also have thousands of programs you can run right now in your browser, including hypercard classics, retro games, and more, you're correct! Also copies of old mainframe software, contemporary open source projects, and CD-ROMs.

And that's definately not all. We've got millions of scholarly papers, complete database dumps of popular websites, a bunch of big photo collections, etc.

GET EXCITED!

{% endblock %} {% block content %}

Featured Content: September 2017

Headquarters Photo

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.

Upcoming Events at Headquarters

Headquarters Photo

300 Funston Ave, Inner Richmond

San Franscisco, CA, USA

  • October 11th: Annual Event
  • More Film Screenings on Sundays
  • Free Lunch every Friday

Keep in Touch!

From the Blog

  • The world is flooding and we won't forget
  • Why the largest democracy in the world blocked our domain
  • What we're doing with robots.txt these days

Twitter (inevitably)

Subscribe to Newsletter:

What Does Internet Archive Do?

Scan Lots of Books

We built our own scanning hardware and digital workflow. Have dozens of workers in centers world-wide. Work with partners, have XYZ books so far.

Record The News

News is important. Who said what? We have been recording since 2009 and have a lot of it, Search by captions, see what you can find, hold everybody accountable! Check these examples.

Crawl The Web

Web is flux-y, always disapearing, big 20th/21st century cultural heritige. We use Heritrix to crawl the web a whole bunch, get big numbers of pages and petabytes.

Some partners submit data or pay, some use Archive-It.

All results available via Wayback Machine.

Archive a page now:

Petabox Datacenter

We are pretty in to storing data for a very long time. Own our own real-estate, hardware, etc instead of using cloud storages. Save money, waste heat building, have a couple sites, lots of disks, great good, some details.

Physical Archives

Physical artifacts are cool as well, so we have a bunch of that over in the east bay. XYZ tons! Neato!

Partnerships and Sustainability

Archive-It Partners

We can't decide everything to crawl, so partner with universities, libraries, other organizations, who provide funding and lists of what to crawl, then we go and do it, archive forever, usually provide as part of wayback.

Over a petabyte so far, thousands of partners.

ArchiveTeam

Have a great and unique relationship with the radical self-organized Archive Team group, which goes out and saves the web. Horray! Eg, 301works, URLteam,

Archival Organizations

Sometimes other archives (like Prelinger, George Blood, etc) work with us to do digitization and physical archiving. Great content, big win!

Alexa Search Engine

We go way back (get it?) with the Alexa search engine. In the early days they were most of our content, and still provide a bunch. Thanks!

Governments

We're in the USA and have done some crawling with Library of Congress and National Records (NARA).

Also work with national libraries and governments from around the world!

Foundation Grants

Here is where we name-drop the big private foundations that often fund feature development and high-impact projects. Thanks!

Researchers

Want to use our huge set of data in a research context? Great. Contact us.

Online Communities

It's great when we find online communities which have already organized content. Sometimes a big collection, sometimes user-generated like stack overflow or reddit.

"Lock it open"!

Donations

Funded by viewers like you!

Also accept some equipment, and you can volunteer.

Contributors Like You!

You can volunteer, or just start uploading random furry convention photos from your laptop right now!. Or download and make copies, use our APIs, whatever!

Create an account

Please read our terms, don't get us sued or raided, don't melt our servers or consume thousands of dollars of disk without asking first.

{% endblock %}