[GH-ISSUE #212] Generate plain-text output file #143

Closed
opened 2026-03-01 14:41:00 +03:00 by kerem · 1 comment
Owner

Originally created by @lifelike on GitHub (Apr 9, 2019).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/212

To get an maximum future-safe version of each saved page that has very limited dependencies on software to view it would be great if it could be saved as text as well (e.g. output.txt).

The best way I can think of doing this is actually to just run output.html through lynx or w3m. An output method that does something like "w3m -dump output.html > output.txt" would go a long way, but there might be some edge-cases I did not consider when I tested.

Originally created by @lifelike on GitHub (Apr 9, 2019). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/212 To get an maximum future-safe version of each saved page that has very limited dependencies on software to view it would be great if it could be saved as text as well (e.g. output.txt). The best way I can think of doing this is actually to just run output.html through lynx or w3m. An output method that does something like "w3m -dump output.html > output.txt" would go a long way, but there might be some edge-cases I did not consider when I tested.
kerem closed this issue 2026-03-01 14:41:00 +03:00
Author
Owner

@pirate commented on GitHub (Apr 9, 2019):

Duplicate of: https://github.com/pirate/ArchiveBox/issues/69

<!-- gh-comment-id:481294525 --> @pirate commented on GitHub (Apr 9, 2019): Duplicate of: https://github.com/pirate/ArchiveBox/issues/69
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#143
No description provided.