mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-26 09:36:01 +03:00
[GH-ISSUE #166] Archive Interactive Site #116
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#116
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @diego898 on GitHub (Mar 9, 2019).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/166
Describe the bug
I am trying to save a full working copy of the interactive site https://distill.pub/2019/activation-atlas/ but when I try
echo "https://distill.pub/2019/activation-atlas/" | ./archivethe resulting archive only preserves the text. Reading the documentation, the default should save everything.@pirate commented on GitHub (Mar 11, 2019):
It's quite a complex site to save, but it does show up properly in the PDF/Screenshot output in my test:
As this is not specifically related to this site, but is a general problem with archiving complex, interactive content, I'm closing it in favor of more specific issues with individual archive methods.
You can track our progress improving a few pieces of interactive site archiving here:
@diego898 commented on GitHub (Mar 11, 2019):
@pirate thanks for the response - is it a goal/within-scope for this project to try and save a site as complex as that?
@pirate commented on GitHub (Mar 11, 2019):
Yes @diego898, it's 100% in-scope. We want to be able to save every website you can view online with perfect fidelity. It's a goal shared by many of the other tools as well, from Archive.org to Webrecorder.io, we all care about getting it right ;)
This is a good set of tests if you're interested in seeing which tools are able to archive interactive sites: http://acid.matkelly.com
@diego898 commented on GitHub (Mar 20, 2019):
hey @pirate - sorry I don't know much about this. How exactly do I "use" or "test" using those sets of tests? thanks!