[PR #337] [MERGED] New feature: change user-agent for curl. #4126

Closed
opened 2026-03-15 01:27:43 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/337
Author: @comsomisha
Created: 4/14/2020
Status: Merged
Merged: 4/15/2020
Merged by: @pirate

Base: masterHead: master


📝 Commits (3)

📊 Changes

3 files changed (+11 additions, -3 deletions)

View changed files

📝 archivebox/archive_methods.py (+4 -2)
📝 archivebox/config.py (+5 -1)
📝 etc/ArchiveBox.conf.default (+2 -0)

📄 Description

Summary
Sometimes you need to change the user agent to save desktop pages in web.archive.org.
For example:
A) curl, default user-agent
https://web.archive.org/web/20200203111954/https://www.sports.ru/tribuna/blogs/puncher/2216091.html
Result: redirect to mobile version
B) curl, curl_user_agent="Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/73.0.3683.75 Safari/537.36"
https://web.archive.org/web/20200309090029/https://www.sports.ru/tribuna/blogs/puncher/2216091.html
Result: desktop version

Changes these areas

  • Bugfixes
  • Feature behavior
  • Command line interface
  • Configuration options
  • Internal architecture
  • Archived data layout on disk

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/337 **Author:** [@comsomisha](https://github.com/comsomisha) **Created:** 4/14/2020 **Status:** ✅ Merged **Merged:** 4/15/2020 **Merged by:** [@pirate](https://github.com/pirate) **Base:** `master` ← **Head:** `master` --- ### 📝 Commits (3) - [`18f0f66`](https://github.com/ArchiveBox/ArchiveBox/commit/18f0f66f1ebaf3a71f4ab35bf88fedcb3ea57ef2) 05042020 - [`bb58053`](https://github.com/ArchiveBox/ArchiveBox/commit/bb580533f715a1b40f8534f81ba99591b3f24821) 0504202002 - [`1aa2a5b`](https://github.com/ArchiveBox/ArchiveBox/commit/1aa2a5b0697e09d20f674571c7f1695ee4c354b2) 15042020 ### 📊 Changes **3 files changed** (+11 additions, -3 deletions) <details> <summary>View changed files</summary> 📝 `archivebox/archive_methods.py` (+4 -2) 📝 `archivebox/config.py` (+5 -1) 📝 `etc/ArchiveBox.conf.default` (+2 -0) </details> ### 📄 Description Summary Sometimes you need to change the user agent to save desktop pages in web.archive.org. For example: A) curl, default user-agent https://web.archive.org/web/20200203111954/https://www.sports.ru/tribuna/blogs/puncher/2216091.html Result: redirect to mobile version B) curl, curl_user_agent="Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/73.0.3683.75 Safari/537.36" https://web.archive.org/web/20200309090029/https://www.sports.ru/tribuna/blogs/puncher/2216091.html Result: desktop version # Changes these areas - [ ] Bugfixes - [x] Feature behavior - [ ] Command line interface - [ ] Configuration options - [ ] Internal architecture - [ ] Archived data layout on disk --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-15 01:27:43 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#4126
No description provided.