[GH-ISSUE #641] Feature Request: Save site-wide Tag sort order based on admin Tag list last sorted column #3417

Closed
opened 2026-03-14 22:46:00 +03:00 by kerem · 5 comments
Owner

Originally created by @Wanty5883 on GitHub (Feb 1, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/641

Type

  • General question or discussion
  • Propose a brand new feature
  • Request modification of existing behavior or design

What is the problem that your feature request solves

Not being able to sort the tags list in other than the tag page. Which is a significant problem as the tag list scale up

Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes

From the tag panel we able to sort the tags by name, ID or slug, this should change the way tags are sorted in the other pages of ArchiveBox.

How badly do you want this new feature?

  • It's an urgent deal-breaker, I can't live without it
  • It's important to add it in the near-mid term future
  • It would be nice to have eventually

  • I'm willing to contribute dev time / money to fix this issue
  • I like ArchiveBox so far / would recommend it to a friend
  • I've had a lot of difficulty getting ArchiveBox set up
Originally created by @Wanty5883 on GitHub (Feb 1, 2021). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/641 <!-- Please fill out the following information, feel free to delete sections if they're not applicable or if long issue templates annoy you :) --> ## Type - [ ] General question or discussion - [ ] Propose a brand new feature - [x] Request modification of existing behavior or design ## What is the problem that your feature request solves <!-- e.g. I need to be able to archive spanish and french subtitle files from a particular <example.com> movie site that's going down soon. --> Not being able to sort the tags list in other than the tag page. Which is a significant problem as the tag list scale up ## Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes <!-- e.g. I specifically need a new archive method to look for multilingual subtitle files related to pages. The bigger picture solution is the ability for custom user scripts to be run in a puppeteer context during archiving. --> From the tag panel we able to sort the tags by name, ID or slug, this should change the way tags are sorted in the other pages of ArchiveBox. ## How badly do you want this new feature? - [ ] It's an urgent deal-breaker, I can't live without it - [x] It's important to add it in the near-mid term future - [ ] It would be nice to have eventually --- - [ ] I'm willing to contribute [dev time](https://github.com/ArchiveBox/ArchiveBox#archivebox-development) / [money](https://github.com/sponsors/pirate) to fix this issue - [x] I like ArchiveBox so far / would recommend it to a friend - [ ] I've had a lot of difficulty getting ArchiveBox set up
Author
Owner

@Wanty5883 commented on GitHub (Feb 1, 2021):

I'll use this ticket to ask a question so I don't have to open a second one.

I am using ArchiveBox in a docker container. It is really slow for me, not archiving a web page, but just the browsing of ArchiveBox itself, changing tags, going to another page, etc.
Did I miss something obvious ? How can I diagnose this issue so I can learn how to fix it ?

<!-- gh-comment-id:770850052 --> @Wanty5883 commented on GitHub (Feb 1, 2021): I'll use this ticket to ask a question so I don't have to open a second one. I am using ArchiveBox in a docker container. It is really slow for me, not archiving a web page, but just the browsing of ArchiveBox itself, changing tags, going to another page, etc. Did I miss something obvious ? How can I diagnose this issue so I can learn how to fix it ?
Author
Owner

@pirate commented on GitHub (Feb 1, 2021):

v0.5 is quite slow due to some architectural issues, we're going to improve the speed in v0.6 significantly hopefully.

Make sure you're running it on an SSD, it relies heavily on fast disk access more than CPU or RAM.

<!-- gh-comment-id:771084884 --> @pirate commented on GitHub (Feb 1, 2021): v0.5 is quite slow due to some architectural issues, we're going to improve the speed in v0.6 significantly hopefully. Make sure you're running it on an SSD, it relies heavily on fast disk access more than CPU or RAM.
Author
Owner

@Wanty5883 commented on GitHub (Feb 1, 2021):

The docker container runs on a VM but the snapshot content does store in a HDD. Basically only the /data/sources and /data/archive does point to the HDD.

What about the tag list in the webpage overall ? Planning to change that as well ? Having a lot of tags and not being able to sort them in different ways is kinda hard as it does scale up

<!-- gh-comment-id:771234375 --> @Wanty5883 commented on GitHub (Feb 1, 2021): The docker container runs on a VM but the snapshot content does store in a HDD. Basically only the `/data/sources` and `/data/archive` does point to the HDD. What about the tag list in the webpage overall ? Planning to change that as well ? Having a lot of tags and not being able to sort them in different ways is kinda hard as it does scale up
Author
Owner

@pirate commented on GitHub (Feb 2, 2021):

Yeah, improving tagging + organization in general is definitely on the roadmap, but to be perfectly honest it's 2nd priority compared to performance and stability.

I think we have a long way to go with performance, and there's some deep refactoring needed to go from the old "run a bunch of bash commands sequentially and store the results in text files" approach to the new "manage a pool of async headless browser workers that read tasks off a message queue" design.

If people want to contribute tagging improvements to the UI or data model, I'm happy to discuss potential features + implementations, but in the meantime I recommend people use other curation/organization tools for their links, and use archivebox as the data storage layer. e.g. with a Pocket/Pinboard free account, or a self-hosted equivalent like Wallabag.

Re: your immediate performance needs @Wanty5883: how many URLs are in your archive, and how big are the index.sqlite3 and /data/archive folders?

<!-- gh-comment-id:771316992 --> @pirate commented on GitHub (Feb 2, 2021): Yeah, improving tagging + organization in general is definitely on the roadmap, but to be perfectly honest it's 2nd priority compared to performance and stability. I think we have a long way to go with performance, and there's some deep refactoring needed to go from the old "run a bunch of bash commands sequentially and store the results in text files" approach to the new "manage a pool of async headless browser workers that read tasks off a message queue" design. If people want to contribute tagging improvements to the UI or data model, I'm happy to discuss potential features + implementations, but in the meantime I recommend people use other curation/organization tools for their links, and use archivebox as the data storage layer. e.g. with a Pocket/Pinboard free account, or a self-hosted equivalent like Wallabag. Re: your immediate performance needs @Wanty5883: how many URLs are in your archive, and how big are the `index.sqlite3` and `/data/archive` folders?
Author
Owner

@pirate commented on GitHub (Apr 6, 2021):

BTW performance is significantly improved in v0.6 now, should be much faster.

You can filter Snapshots by tag in the latest version using the panel on the right, does that cover your needs? You can also use the search box.

image

I don't think it's worth the complexity of the extra code to globally sort the tags based on the sort options chosen on the Tags page, I'd rather just make the existing tag filtering and curation options easier so that it's not as necessary. Thanks for the suggestion though, let me know if v0.6 improves the situation for you enough, I added a lot of tag improvements.

<!-- gh-comment-id:813857012 --> @pirate commented on GitHub (Apr 6, 2021): BTW performance is significantly improved in v0.6 now, should be much faster. You can filter Snapshots by tag in the latest version using the panel on the right, does that cover your needs? You can also use the search box. ![image](https://user-images.githubusercontent.com/511499/113667421-caf48400-967e-11eb-8804-3fc2643505ae.png) I don't think it's worth the complexity of the extra code to globally sort the tags based on the sort options chosen on the Tags page, I'd rather just make the existing tag filtering and curation options easier so that it's not as necessary. Thanks for the suggestion though, let me know if v0.6 improves the situation for you enough, I added a lot of tag improvements.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#3417
No description provided.