mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 17:16:00 +03:00
[GH-ISSUE #342] Question: ...How to merge multiple Archive collections #3267
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#3267
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ekiel on GitHub (Apr 29, 2020).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/342
Right now I’m using ArchiveBox for multiple collections on different machines and the contents are not the same- is it possible to merge the ArchiveBox outputs without having to re-scrape everything?
@pirate commented on GitHub (Apr 29, 2020):
Yup, this is supported natively with the new
archivebox initfeature in>=v0.4. You can drag all the timestamp folders from one archive folder to the other, runinit, and it will import them all, effectively merging the two archives.The new version also doesn't require keeping the output folder in the same folder as the code, instead you
pipinstall thearchiveboxcommand system-wide (or in a virtualenv), and then you can runarchivebox initin any folder to use it as a data folder.You can install from the
djangobranch to try out the early pre-release version, or subscribe to this PR to get notified of when it actually gets merged:https://github.com/pirate/ArchiveBox/pull/207 (it may take a while, I rarely have coding time to dedicate to this project these days).