Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Full page archival

Does it grab the DOM from my browser as it sees it? Or is it a separate request? If so, how does it deal with authentication?



So there are different ways it archives a webpage.

It currently stores the full webpages as a single html file, a screenshot, a pdf, a read-it-later view.

Aside from that, you can also send the webpages to the Wayback Machine to take a snapshot.

To archive pages behind a login or paywall, you can use the browser extension, which captures an image of the webpage in the browser and sends it to the server.


> To archive pages behind a login or paywall, you can use the browser extension, which captures an image of the webpage in the browser and sends it to the server.

Just an image? So no full text search?


> To archive pages behind a login or paywall, you can use the browser extension, which captures an image of the webpage in the browser and sends it to the server.

It'd be awesome to integrate this with the SingleFile extension, which captures any webpage into a self-contained HTML file (with JS, CSS, etc, inlined).


We might add this, it's actually highly suggested by the users :)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: