[GH-ISSUE #1013] Question: Small backup just of URLs / Index #2145

Closed
opened 2026-03-01 17:56:51 +03:00 by kerem · 1 comment
Owner

Originally created by @turian on GitHub (Aug 12, 2022).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1013

I want to create a small backup just of URLs, time added, and maybe other index / metadata.

I don't want to backup the whole media archives + HTML etc.

What file(s) should I backup?

ArchiveBox.conf
index.sqlite3
index.sqlite3-shm
index.sqlite3-wal

Is that correct?

Originally created by @turian on GitHub (Aug 12, 2022). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1013 I want to create a small backup *just* of URLs, time added, and maybe other index / metadata. I don't want to backup the whole media archives + HTML etc. What file(s) should I backup? ``` ArchiveBox.conf index.sqlite3 index.sqlite3-shm index.sqlite3-wal ``` Is that correct?
kerem closed this issue 2026-03-01 17:56:51 +03:00
Author
Owner

@pirate commented on GitHub (Aug 20, 2022):

Yes those alone are fine. If you want you can also save more detailed per-snapshot metadata by backing up archive/*/index.json.

You can even use the shallow backup normally with archivebox by running archivebox init in the backup directory. It will let you browse the entries in the UI but obviously viewing actual snapshot content will show 404s.

<!-- gh-comment-id:1221370078 --> @pirate commented on GitHub (Aug 20, 2022): Yes those alone are fine. If you want you can also save more detailed per-snapshot metadata by backing up `archive/*/index.json`. You can even use the shallow backup normally with archivebox by running `archivebox init` in the backup directory. It will let you browse the entries in the UI but obviously viewing actual snapshot content will show 404s.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2145
No description provided.