[PR #520] [CLOSED] Split Snapshot into Link & Snapshot + migrate #2714

Closed
opened 2026-03-01 18:00:32 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/520
Author: @mAAdhaTTah
Created: 10/30/2020
Status: Closed

Base: masterHead: split-snapshot-with-link


📝 Commits (1)

  • 429f1de Split Snapshot into Link & Snapshot + migrate

📊 Changes

3 files changed (+96 additions, -10 deletions)

View changed files

📝 archivebox/core/admin.py (+2 -2)
archivebox/core/migrations/0007_auto_20201030_1705.py (+60 -0)
📝 archivebox/core/models.py (+34 -8)

📄 Description

Proof of concept.

Summary

Poking around some ideas to improve the data model in ArchiveBox. This introduces splits the Snapshot into a Snapshot & Link model, with a Link representing a single URL added to ArchiveBox, and a Snapshot representing one download that Link at a given time.

There's definitely more work to be done with this (including testing against more than just one URL + tags), but this seems like one of several necessary steps to move data that's currently on the fs into the db (similar to #513).

How does this approach look? Thoughts?

Related issues

Conversation in #496.

Changes these areas

  • Bugfixes
  • Feature behavior
  • Command line interface
  • Configuration options
  • Internal architecture
  • Snapshot data layout on disk

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/520 **Author:** [@mAAdhaTTah](https://github.com/mAAdhaTTah) **Created:** 10/30/2020 **Status:** ❌ Closed **Base:** `master` ← **Head:** `split-snapshot-with-link` --- ### 📝 Commits (1) - [`429f1de`](https://github.com/ArchiveBox/ArchiveBox/commit/429f1de9c1e5492d31914f6b414b0d753ce2ea81) Split Snapshot into Link & Snapshot + migrate ### 📊 Changes **3 files changed** (+96 additions, -10 deletions) <details> <summary>View changed files</summary> 📝 `archivebox/core/admin.py` (+2 -2) ➕ `archivebox/core/migrations/0007_auto_20201030_1705.py` (+60 -0) 📝 `archivebox/core/models.py` (+34 -8) </details> ### 📄 Description Proof of concept. # Summary Poking around some ideas to improve the data model in ArchiveBox. This introduces splits the Snapshot into a Snapshot & Link model, with a Link representing a single URL added to ArchiveBox, and a Snapshot representing one download that Link at a given time. There's definitely more work to be done with this (including testing against more than just one URL + tags), but this seems like one of several necessary steps to move data that's currently on the fs into the db (similar to #513). How does this approach look? Thoughts? # Related issues Conversation in #496. # Changes these areas - [ ] Bugfixes - [X] Feature behavior - [ ] Command line interface - [ ] Configuration options - [ ] Internal architecture - [ ] Snapshot data layout on disk --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-01 18:00:32 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2714
No description provided.