[GH-ISSUE #647] Feature Request: search by status in web ui #3425

Closed
opened 2026-03-14 22:47:13 +03:00 by kerem · 1 comment
Owner

Originally created by @khimaros on GitHub (Feb 6, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/647

Type

  • General question or discussion
  • Propose a brand new feature
  • Request modification of existing behavior or design

What is the problem that your feature request solves

i have an enormous number of links to archive and would like to be able to search for the ones which have already been archived in the web ui.

Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes

typing status:valid or status:archived in the ui would display the relevant snapshots only.

What hacks or alternative solutions have you tried to solve the problem?

this can be accomplished on the command line:

$ archive box list --status archived

How badly do you want this new feature?

  • It's an urgent deal-breaker, I can't live without it
  • It's important to add it in the near-mid term future
  • It would be nice to have eventually

  • I'm willing to contribute dev time / money to fix this issue
  • I like ArchiveBox so far / would recommend it to a friend
  • I've had a lot of difficulty getting ArchiveBox set up
Originally created by @khimaros on GitHub (Feb 6, 2021). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/647 <!-- Please fill out the following information, feel free to delete sections if they're not applicable or if long issue templates annoy you :) --> ## Type - [ ] General question or discussion - [x] Propose a brand new feature - [ ] Request modification of existing behavior or design ## What is the problem that your feature request solves <!-- e.g. I need to be able to archive spanish and french subtitle files from a particular <example.com> movie site that's going down soon. --> i have an enormous number of links to archive and would like to be able to search for the ones which have already been archived in the web ui. ## Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes <!-- e.g. I specifically need a new archive method to look for multilingual subtitle files related to pages. The bigger picture solution is the ability for custom user scripts to be run in a puppeteer context during archiving. --> typing `status:valid` or `status:archived` in the ui would display the relevant snapshots only. ## What hacks or alternative solutions have you tried to solve the problem? <!-- A clear and concise description of any alternative solutions, workarounds, or other software you've considered using to fix the problem. --> this can be accomplished on the command line: ``` $ archive box list --status archived ``` ## How badly do you want this new feature? - [ ] It's an urgent deal-breaker, I can't live without it - [ ] It's important to add it in the near-mid term future - [x] It would be nice to have eventually --- - [x] I'm willing to contribute [dev time](https://github.com/ArchiveBox/ArchiveBox#archivebox-development) / [money](https://github.com/sponsors/pirate) to fix this issue - [x] I like ArchiveBox so far / would recommend it to a friend - [ ] I've had a lot of difficulty getting ArchiveBox set up
Author
Owner

@pirate commented on GitHub (Apr 6, 2021):

I've sort of implemented this in c98053a. It's not exactly what you're asking for since it's not technically the Snapshot status, it's the ArchiveResult status (aka the individual extractor outputs under a Snapshot). It would be tricky to allow filtering by Snapshot status directly since theres actually no db field for it, it's a derived property based on some heuristics and cant be queried directly.

It should be out with the next release v0.6 soon. You can also try it early by installing directly from the dev branch.

pip install "git+https://github.com/ArchiveBox/ArchiveBox.git@dev"

Comment back here if that solution doesn't work / you absolutely need Snapshot status as originally described and we can discuss further.

<!-- gh-comment-id:813851118 --> @pirate commented on GitHub (Apr 6, 2021): I've sort of implemented this in c98053a. It's not exactly what you're asking for since it's not technically the Snapshot status, it's the ArchiveResult status (aka the individual extractor outputs under a Snapshot). It would be tricky to allow filtering by Snapshot status directly since theres actually no db field for it, it's a derived property based on some heuristics and cant be queried directly. It should be out with the next release v0.6 soon. You can also try it early by installing directly from the dev branch. ```bash pip install "git+https://github.com/ArchiveBox/ArchiveBox.git@dev" ``` Comment back here if that solution doesn't work / you absolutely need Snapshot status as originally described and we can discuss further.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#3425
No description provided.