mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-26 01:26:00 +03:00
[GH-ISSUE #939] Does ArchiveBox support archiving a single page and all its related assets? #3602
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#3602
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @zhiqiangxu on GitHub (Mar 2, 2022).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/939
Sometimes we want to archive a single page with all related assets instead of the whole site, does archivebox support this feature?
@varenc commented on GitHub (Mar 3, 2022):
That's literally exactly what ArchiveBox does. It saves individual URLs, and their associated assets. It does not try to crawl entire websites. What gave you the wrong impression?
@zhiqiangxu commented on GitHub (Mar 3, 2022):
Because I saw this command:
It looks like it's going to crawl the entire website, especially the word
schedule, if it only crawls a single url, no need toschedule, right?@akhilleusuggo commented on GitHub (Mar 3, 2022):
schedule is for feed/rss webpages that changes every=day/hour/month etc.
archivebox will not re-download the same url
to redownload the same url you need to take a snapshot
@zhiqiangxu commented on GitHub (Mar 4, 2022):
This command fails to install:
The final output is this:
My installed python version is 2.7, does archivebox only support python 3.x?
@pirate commented on GitHub (Mar 5, 2022):
Yes, we dropped Python 2.7 support long ago. @zhiqiangxu
@pirate commented on GitHub (Mar 5, 2022):
Docker