[GH-ISSUE #715] Archiving web design #3468

Closed
opened 2026-03-14 23:04:23 +03:00 by kerem · 2 comments
Owner

Originally created by @raphaelbastide on GitHub (Apr 19, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/715

I am trying to archive websites for their design characteristics, my goal is to have a copy containing every asset in order to replicate exactly the graphic characteristics of a page (CSS, fonts, images…). I am wondering if ArchiveBox is the tool I need.

After a first tests, issues appear for fonts and ajax calls (or image lazy loading): When I try to archive this web page for instance with all archive methods selected, none of the result fits to my goal. The fonts seams to be downloaded for singlefile only and the images appears to be not downloaded at all, in fact, they keep their original URL. Do you have any tips for me to archive this kind of page?

Other website causing similar issues

Originally created by @raphaelbastide on GitHub (Apr 19, 2021). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/715 I am trying to archive websites for their design characteristics, my goal is to have a copy containing every asset in order to replicate exactly the graphic characteristics of a page (CSS, fonts, images…). I am wondering if ArchiveBox is the tool I need. After a first tests, issues appear for fonts and ajax calls (or image lazy loading): When I try to archive this [web page](http://fondsinternational.com/) for instance with all archive methods selected, none of the result fits to my goal. The fonts seams to be downloaded for singlefile only and the images appears to be not downloaded at all, in fact, they keep their original URL. Do you have any tips for me to archive this kind of page? Other website causing similar issues - http://www.guydecointet.org - http://edwardthomson.net/ - http://www.dougkoellmer.com/
kerem 2026-03-14 23:04:23 +03:00
Author
Owner

@pirate commented on GitHub (Apr 19, 2021):

Looks like the fonts are saved: https://demo.archivebox.io/archive/1618859543.748595/fondsinternational.com/wp-content/themes/Empire%20II/assets/fonts/

Wget gets them, so you'll find them in the wget output folder ./archive/<timestamp>/domainhere.com/.../fonts.

You may prefer https://ArchiveWeb.page and https://ReplayWeb.page for higher fidelity archives though.

<!-- gh-comment-id:822720107 --> @pirate commented on GitHub (Apr 19, 2021): Looks like the fonts are saved: https://demo.archivebox.io/archive/1618859543.748595/fondsinternational.com/wp-content/themes/Empire%20II/assets/fonts/ Wget gets them, so you'll find them in the wget output folder `./archive/<timestamp>/domainhere.com/.../fonts`. You may prefer https://ArchiveWeb.page and https://ReplayWeb.page for higher fidelity archives though.
Author
Owner

@raphaelbastide commented on GitHub (Apr 20, 2021):

Thank you for your answer @pirate!

<!-- gh-comment-id:823077190 --> @raphaelbastide commented on GitHub (Apr 20, 2021): Thank you for your answer @pirate!
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#3468
No description provided.