[GH-ISSUE #1096] Feature Request: Add/upload attachments and notes to snapshot #690

Open
opened 2026-03-01 14:45:32 +03:00 by kerem · 2 comments
Owner

Originally created by @Jwink3101 on GitHub (Feb 10, 2023).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1096

Type

  • General question or discussion
  • Propose a brand new feature
  • Request modification of existing behavior or design

What is the problem that your feature request solves

Sometimes there is additional media that I capture manually or I want to add some note to remember why I saved it.

For example, I archived a NYTimes Podcast page but it didn't pick up the transcript. So I saved it manually. I would like to be able to attach that.

Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes

Ideally, I would be able to have an editable note box and the ability to add attachments. Maybe have it be markdown and a way to link or reference it too

What hacks or alternative solutions have you tried to solve the problem?

I can upload anything into the archive/ but it requires figuring out which one it is and then future discoverability is majorly impacted

How badly do you want this new feature?

  • It's an urgent deal-breaker, I can't live without it
  • It's important to add it in the near-mid term future
  • It would be nice to have eventually

(probably somewhere between the bottom two

  • [(probably)] I'm willing to contribute dev time / money to fix this issue
  • I like ArchiveBox so far / would recommend it to a friend
  • [(some)] I've had a lot of difficulty getting ArchiveBox set up

I am pretty proficient in Python but short on time. I will poke around a bit and see what it may take.

Originally created by @Jwink3101 on GitHub (Feb 10, 2023). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1096 <!-- Please fill out the following information, feel free to delete sections if they're not applicable or if long issue templates annoy you :) --> ## Type - [ ] General question or discussion - [X] Propose a brand new feature - [ ] Request modification of existing behavior or design ## What is the problem that your feature request solves <!-- e.g. I need to be able to archive spanish and french subtitle files from a particular <example.com> movie site that's going down soon. --> Sometimes there is additional media that I capture manually or I want to add some note to remember why I saved it. For example, I archived a NYTimes Podcast page but it didn't pick up the transcript. So I saved it manually. I would like to be able to attach that. ## Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes <!-- e.g. I specifically need a new archive method to look for multilingual subtitle files related to pages. The bigger picture solution is the ability for custom user scripts to be run in a puppeteer context during archiving. --> Ideally, I would be able to have an editable note box and the ability to add attachments. Maybe have it be markdown and a way to link or reference it too ## What hacks or alternative solutions have you tried to solve the problem? <!-- A clear and concise description of any alternative solutions, workarounds, or other software you've considered using to fix the problem. --> I can upload anything into the `archive/` but it requires figuring out which one it is and then future discoverability is majorly impacted ## How badly do you want this new feature? - [ ] It's an urgent deal-breaker, I can't live without it - [ ] It's important to add it in the near-mid term future - [X] It would be nice to have eventually (probably somewhere between the bottom two --- - [(probably)] I'm willing to contribute [dev time](https://github.com/ArchiveBox/ArchiveBox#archivebox-development) / [money](https://github.com/sponsors/pirate) to fix this issue - [X] I like ArchiveBox so far / would recommend it to a friend - [(some)] I've had a lot of difficulty getting ArchiveBox set up I am pretty proficient in Python but short on time. I will poke around a bit and see what it may take.
Author
Owner

@pirate commented on GitHub (Feb 10, 2023):

You can technically already do most of this, if you go to the Log page in the UI you can manually add a new ArchiveResult with [Add Archive Result +] and attach it to a Snapshot to track arbitrary attachments/notes.

The ArchiveResult.output field can point to any file you want within the snapshot directory, it can be a custom attachment or text file with notes.

So you can upload any files you want into the snapshot data dir (you can figure out which dir by seeing the timestamp for the snapshot in the URL/admin UI), and as long as there in an ArchiveResult pointed to the files they will remain tracked by ArchiveBox.

It's not an ideal UX / not as easy as just adding notes in a UI, but it's more resilient long-term as the db doesn't become a single point of failure where your notes can get lost.

Screenshot 2023-02-10 at 2 03 56 PM image
<!-- gh-comment-id:1426401899 --> @pirate commented on GitHub (Feb 10, 2023): You can technically already do most of this, if you go to the `Log` page in the UI you can manually add a new ArchiveResult with `[Add Archive Result +]` and attach it to a Snapshot to track arbitrary attachments/notes. The ArchiveResult.output field can point to any file you want within the snapshot directory, it can be a custom attachment or text file with notes. So you can upload any files you want into the snapshot data dir (you can figure out which dir by seeing the timestamp for the snapshot in the URL/admin UI), and as long as there in an ArchiveResult pointed to the files they will remain tracked by ArchiveBox. It's not an ideal UX / not as easy as just adding notes in a UI, but it's more resilient long-term as the db doesn't become a single point of failure where your notes can get lost. <img width="834" alt="Screenshot 2023-02-10 at 2 03 56 PM" src="https://user-images.githubusercontent.com/511499/218207897-44b8b56f-0f92-411d-89f0-7b7e83557dad.png"> <img width="2044" alt="image" src="https://user-images.githubusercontent.com/511499/218207880-fb709105-5bfa-464c-a1b2-737ae0ab9b00.png">
Author
Owner

@Jwink3101 commented on GitHub (Feb 11, 2023):

That makes sense though I think it is very much along the lines of "What hacks or alternative solutions have you tried to solve the problem?".

I do like the idea of not relying on the database. Even if I could, through the web, upload an immutable note, that would go a long way.

<!-- gh-comment-id:1426551174 --> @Jwink3101 commented on GitHub (Feb 11, 2023): That makes sense though I think it is very much along the lines of "What hacks or alternative solutions have you tried to solve the problem?". I do like the idea of not relying on the database. Even if I could, through the web, upload an immutable note, that would go a long way.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#690
No description provided.