[GH-ISSUE #1111] Feature Request: Add support for sending custom request headers across all archiving methods #2209

Open
opened 2026-03-01 17:57:17 +03:00 by kerem · 1 comment
Owner

Originally created by @hykilpikonna on GitHub (Mar 1, 2023).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1111

The page I want to archive uses automatic language detection / i18n, and I want to archive in a language other than English. One way to achieve this is to add a custom accept-language request header. Is there an option to easily configure request headers for each archive?

Originally created by @hykilpikonna on GitHub (Mar 1, 2023). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1111 The page I want to archive uses automatic language detection / i18n, and I want to archive in a language other than English. One way to achieve this is to add a custom `accept-language` request header. Is there an option to easily configure request headers for each archive?
Author
Owner

@pirate commented on GitHub (Mar 2, 2023):

There is not a universal setting to add headers for all methods, but you can add them individually using each extractor's <extractorname>_ARGS='["--some-headers-here...", "--some-other-args=..."]' config settings.

For example to set the header for curl you'd use CURL_ARGS https://github.com/ArchiveBox/ArchiveBox/wiki/Configuration#curl_args:

archivebox config --set CURL_ARGS='["--tlsv1.3","--http2", "--header", "accept-language: en-us en"]'

Adding support for custom headers for every extractor at once would be cool, but I'm not sure if we'll get around to adding that anytime soon.

<!-- gh-comment-id:1451143683 --> @pirate commented on GitHub (Mar 2, 2023): There is not a universal setting to add headers for all methods, but you can add them individually using each extractor's `<extractorname>_ARGS='["--some-headers-here...", "--some-other-args=..."]'` config settings. For example to set the header for curl you'd use `CURL_ARGS` https://github.com/ArchiveBox/ArchiveBox/wiki/Configuration#curl_args: ```bash archivebox config --set CURL_ARGS='["--tlsv1.3","--http2", "--header", "accept-language: en-us en"]' ``` Adding support for custom headers for every extractor at once would be cool, but I'm not sure if we'll get around to adding that anytime soon.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2209
No description provided.