[PR #1201] [CLOSED] Remove/detect empty media extractor directory #1347

Closed
opened 2026-03-01 14:49:25 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/1201
Author: @overhacked
Created: 8/4/2023
Status: Closed

Base: devHead: retry_empty_media


📝 Commits (2)

  • 0d21f24 Remove/detect empty media extractor directory
  • 6224794 Media directory is empty if only contains dotfiles

📊 Changes

1 file changed (+9 additions, -1 deletions)

View changed files

📝 archivebox/extractors/media.py (+9 -1)

📄 Description

If the media extraction fails, try to remove an empty media directory,
ignoring any errors. When retrying extractors (e.g. --update), detect
an empty media directory and run the extractor; recovers from failed
media extractions prior to this commit.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/1201 **Author:** [@overhacked](https://github.com/overhacked) **Created:** 8/4/2023 **Status:** ❌ Closed **Base:** `dev` ← **Head:** `retry_empty_media` --- ### 📝 Commits (2) - [`0d21f24`](https://github.com/ArchiveBox/ArchiveBox/commit/0d21f24f5d796722b39a76cb39cf337305ea0d0a) Remove/detect empty media extractor directory - [`6224794`](https://github.com/ArchiveBox/ArchiveBox/commit/622479436be1a56c3647fdf55bd28c09020dd334) Media directory is empty if only contains dotfiles ### 📊 Changes **1 file changed** (+9 additions, -1 deletions) <details> <summary>View changed files</summary> 📝 `archivebox/extractors/media.py` (+9 -1) </details> ### 📄 Description If the media extraction fails, try to remove an empty media directory, ignoring any errors. When retrying extractors (e.g. `--update`), detect an empty media directory and run the extractor; recovers from failed media extractions prior to this commit. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-01 14:49:25 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#1347
No description provided.