[PR #1723] [MERGED] Add MAX_URL_ATTEMPTS option to ArchiveBox #4487

Closed
opened 2026-03-15 01:47:09 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/1723
Author: @pirate
Created: 12/29/2025
Status: Merged
Merged: 12/29/2025
Merged by: @pirate

Base: devHead: claude/add-max-url-attempts-oBHCD


📝 Commits (2)

  • 88d7906 Add MAX_URL_ATTEMPTS config option to stop retries after too many failures
  • f88182d Merge remote-tracking branch 'origin/dev' into claude/add-max-url-attempts-oBHCD

📊 Changes

2 files changed (+27 additions, -2 deletions)

View changed files

📝 archivebox/config/common.py (+1 -0)
📝 archivebox/core/models.py (+26 -2)

📄 Description

…lures

Adds a new MAX_URL_ATTEMPTS configuration option (default: 50) that stops retrying ArchiveResult hooks for a snapshot once that many failures have been recorded. This prevents infinite retry loops for problematic URLs.

When the limit is reached, any pending ArchiveResults for that snapshot are marked as SKIPPED with an explanatory message.

Summary

Related issues

Changes these areas

  • Bugfixes
  • Feature behavior
  • Command line interface
  • Configuration options
  • Internal architecture
  • Snapshot data layout on disk

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/1723 **Author:** [@pirate](https://github.com/pirate) **Created:** 12/29/2025 **Status:** ✅ Merged **Merged:** 12/29/2025 **Merged by:** [@pirate](https://github.com/pirate) **Base:** `dev` ← **Head:** `claude/add-max-url-attempts-oBHCD` --- ### 📝 Commits (2) - [`88d7906`](https://github.com/ArchiveBox/ArchiveBox/commit/88d7906033874be6a644e9a754eaed7b8274b57a) Add MAX_URL_ATTEMPTS config option to stop retries after too many failures - [`f88182d`](https://github.com/ArchiveBox/ArchiveBox/commit/f88182df7ad8b7c94a4b0924bab1320f858edcbf) Merge remote-tracking branch 'origin/dev' into claude/add-max-url-attempts-oBHCD ### 📊 Changes **2 files changed** (+27 additions, -2 deletions) <details> <summary>View changed files</summary> 📝 `archivebox/config/common.py` (+1 -0) 📝 `archivebox/core/models.py` (+26 -2) </details> ### 📄 Description …lures Adds a new MAX_URL_ATTEMPTS configuration option (default: 50) that stops retrying ArchiveResult hooks for a snapshot once that many failures have been recorded. This prevents infinite retry loops for problematic URLs. When the limit is reached, any pending ArchiveResults for that snapshot are marked as SKIPPED with an explanatory message. <!-- IMPORTANT: Do not submit PRs with only formatting / PEP8 / line length changes. --> # Summary <!--e.g. This PR fixes ABC or adds the ability to do XYZ...--> # Related issues <!-- e.g. #123 or Roadmap goal # https://github.com/pirate/ArchiveBox/wiki/Roadmap --> # Changes these areas - [ ] Bugfixes - [ ] Feature behavior - [ ] Command line interface - [ ] Configuration options - [ ] Internal architecture - [ ] Snapshot data layout on disk --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-15 01:47:09 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#4487
No description provided.