web.archive.org

Man of Honour
Man of Honour
Joined
3 May 2004
Posts
17,722
Location
Kapitalist Republik of Surrey
How does this site work then? It's the one where you can put in a website and it will bring back all the historical pages from that URL. Where is it all stored and how does it find historical images that aren't on the servers any more? How does it know a site has been changed? I remember once I used it and the page that came back was a bare html shell with no images. Now it's got 90% of the old images on it.

Edjumacate me :)
 
It uses a crawling bot to periodically crawl all of the websites in its database and download the lot. It's easy enough to tell if somethings changed when it's all stored on your system...
 
Bargains!!! :)

Yup, does much the same as Google, but Google stores less content. I don't even want to know how much disk space they have - petabytes I'm sure, and if it's not getting close to an exabyte yet, I suspect it soon will be.

Edit - claimed 2 petabytes but I suspect that's slightly out of date. The exabyte is a way off then but I'm sure they're working on it. :)
 
Last edited:
Back
Top Bottom