[GH-ISSUE #826] How can I change the default archive methods on web frontend / CLI when I select nothing? #2021

Open
opened 2026-03-01 17:55:53 +03:00 by kerem · 3 comments
Owner

Originally created by @WantToLearnJapanese on GitHub (Aug 13, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/826

Currently behavior: archive methods (select at least 1, otherwise all will be used by default)
Expected behavior: Configurable default options when I choose nothing.

I know: I can opt-out methods by set SAVE_* as False.
=> But: It just disables the functionality. If I SAVE_PDF=False, even I select pdf, Archivebox does nothing.

Originally created by @WantToLearnJapanese on GitHub (Aug 13, 2021). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/826 Currently behavior: archive methods (select at least 1, otherwise **all will be used by default**) Expected behavior: Configurable default options when I **choose nothing**. I know: I can opt-out methods by set SAVE_* as False. => But: It just disables the functionality. If I SAVE_PDF=False, even I select pdf, Archivebox does nothing.
Author
Owner

@WantToLearnJapanese commented on GitHub (Aug 13, 2021):

  • I propose to add the config option to set a list of defaults methods (which could be passed to the "extractors" argument of this function.
<!-- gh-comment-id:898433740 --> @WantToLearnJapanese commented on GitHub (Aug 13, 2021): - I propose to add the config option to set a list of defaults methods (which could be passed to the "extractors" argument of this [function](https://github.com/ArchiveBox/ArchiveBox/blob/f6aad7f9a35d6ba876235b8a23911df2265736d9/archivebox/main.py#L554).
Author
Owner

@pirate commented on GitHub (Jan 19, 2024):

Can't promise I'll get around to adding this myself anytime soon, but I'd be down to review PRs to add a DEFAULT_EXTRACTORS option that takes a CSV of extractor names to enable by default.

It should modify the archivebox add, archivebox schedule, and /add/ interaces to use that set by default, but still allow overriding with --extract=somedisableddextractor,etc or by choosing in the UI list.

<!-- gh-comment-id:1899659193 --> @pirate commented on GitHub (Jan 19, 2024): Can't promise I'll get around to adding this myself anytime soon, but I'd be down to review PRs to add a `DEFAULT_EXTRACTORS` option that takes a CSV of extractor names to enable by default. It should modify the `archivebox add`, `archivebox schedule`, and `/add/` interaces to use that set by default, but still allow overriding with `--extract=somedisableddextractor,etc` or by choosing in the UI list.
Author
Owner

@pirate commented on GitHub (Jan 19, 2024):

Related: https://github.com/ArchiveBox/ArchiveBox/issues/881

<!-- gh-comment-id:1899774594 --> @pirate commented on GitHub (Jan 19, 2024): Related: https://github.com/ArchiveBox/ArchiveBox/issues/881
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2021
No description provided.