[PR #669] add command: --parser option (fixes #235) #2771

Closed
opened 2026-03-01 18:00:42 +03:00 by kerem · 0 comments
Owner

Original Pull Request: https://github.com/ArchiveBox/ArchiveBox/pull/669

State: closed
Merged: Yes


Summary

This PR adds an additional "input format" option to archivebox add. When set to a value other than the default, only that format will be parsed. If the url-list format is specified, each non-empty line in input files is simply added as a URL.

Related issues

#235

Stuff not yet done/determined

  • should the other parsers should also be available using this option?
  • documentation updates (especially the wiki section "Import a list of URLs from a text file")

Changes these areas

  • Bugfixes
  • Feature behavior
  • Command line interface
  • Configuration options
  • Internal architecture
  • Snapshot data layout on disk
**Original Pull Request:** https://github.com/ArchiveBox/ArchiveBox/pull/669 **State:** closed **Merged:** Yes --- # Summary This PR adds an additional "input format" option to `archivebox add`. When set to a value other than the default, only that format will be parsed. If the `url-list` format is specified, each non-empty line in input files is simply added as a URL. # Related issues #235 # Stuff not yet done/determined - [ ] should the other parsers should also be available using this option? - [ ] documentation updates (especially the wiki section "Import a list of URLs from a text file") # Changes these areas - [ ] Bugfixes - [x] Feature behavior - [x] Command line interface - [ ] Configuration options - [ ] Internal architecture - [ ] Snapshot data layout on disk
kerem 2026-03-01 18:00:42 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2771
No description provided.