Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
pronoiac
6 months ago
|
parent
|
context
|
favorite
| on:
1 Trillion Web Pages Archived
The Archive Team - not part of the Internet Archive - worked on a distributed backup of a portion of the Internet Archive -
https://wiki.archiveteam.org/index.php/INTERNETARCHIVE.BAK
It's been dormant / on hiatus for a few years now.
smallerize
6 months ago
[–]
That can only cover other collections though, because the WARC files from the Wayback Machine web scrapes are not public.
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
It's been dormant / on hiatus for a few years now.