[GH-ISSUE #915] Question: Merge two ArchiveBox collections #2076

Closed
opened 2026-03-01 17:56:17 +03:00 by kerem · 3 comments
Owner

Originally created by @dbready on GitHub (Jan 23, 2022).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/915

First of all -thanks for this great project. Gives me real peace of mind that I know I can easily reference back to a few articles forever.

I have two different machines that I use on a regular basis. Each of these have their own Archives I created with some collection of same/different urls archived at different times (theoretically, there would be no overlapping timestamps, but time is a funny thing).

Is there a built in mechanism I can use to merge Archivebox A into Archivebox B to get the union of content into a single repository? I was unable to find documentation describing this scenario, but it is possible I missed something obvious.

Thanks.

Originally created by @dbready on GitHub (Jan 23, 2022). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/915 First of all -thanks for this great project. Gives me real peace of mind that I know I can easily reference back to a few articles forever. I have two different machines that I use on a regular basis. Each of these have their own Archives I created with some collection of same/different urls archived at different times (theoretically, there would be no overlapping timestamps, but time is a funny thing). Is there a built in mechanism I can use to merge Archivebox A into Archivebox B to get the union of content into a single repository? I was unable to find documentation describing this scenario, but it is possible I missed something obvious. Thanks.
kerem 2026-03-01 17:56:17 +03:00
Author
Owner

@pirate commented on GitHub (Jan 24, 2022):

Yes, it's quite easy. First, upgrade both old collections to the most recent ArchiveBox version, then create a new collection in a new folder somewhere. Drag everything under archive/* in both old collections into the new collection's archive/ folder, then run archivebox init in the new collection to generate the index.

<!-- gh-comment-id:1019652164 --> @pirate commented on GitHub (Jan 24, 2022): Yes, it's quite easy. First, upgrade both old collections to the most recent ArchiveBox version, then create a new collection in a new folder somewhere. Drag everything under `archive/*` in both old collections into the new collection's `archive/` folder, then run `archivebox init` in the new collection to generate the index.
Author
Owner

@prnake commented on GitHub (Feb 9, 2022):

Yes, it's quite easy. First, upgrade both old collections to the most recent ArchiveBox version, then create a new collection in a new folder somewhere. Drag everything under archive/* in both old collections into the new collection's archive/ folder, then run archivebox init in the new collection to generate the index.

It took me quite a while to figure out how to rebuild the database from all existing archives, and I think this method should be written into README, as there seems to be no mention of the correct import method in all the documentation now.

<!-- gh-comment-id:1033952864 --> @prnake commented on GitHub (Feb 9, 2022): > Yes, it's quite easy. First, upgrade both old collections to the most recent ArchiveBox version, then create a new collection in a new folder somewhere. Drag everything under `archive/*` in both old collections into the new collection's `archive/` folder, then run `archivebox init` in the new collection to generate the index. It took me quite a while to figure out how to rebuild the database from all existing archives, and I think this method should be written into README, as there seems to be no mention of the correct import method in all the documentation now.
Author
Owner

@pirate commented on GitHub (Mar 16, 2022):

Added a wiki page: https://github.com/ArchiveBox/ArchiveBox/wiki/Upgrading---Merging-Archives

<!-- gh-comment-id:1069633407 --> @pirate commented on GitHub (Mar 16, 2022): Added a wiki page: https://github.com/ArchiveBox/ArchiveBox/wiki/Upgrading---Merging-Archives
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2076
No description provided.