[GH-ISSUE #564] New Extractor Idea: gallery-dl for image gallery downloading #3378

Open
opened 2026-03-14 22:30:39 +03:00 by kerem · 5 comments
Owner

Originally created by @pirate on GitHub (Dec 2, 2020).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/564

Add a new extractor for image gallery photo downloading:

https://github.com/mikf/gallery-dl

PRs are welcome for this one! But please discuss your approach here first before starting any work!

Originally created by @pirate on GitHub (Dec 2, 2020). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/564 Add a new extractor for image gallery photo downloading: https://github.com/mikf/gallery-dl PRs are welcome for this one! But please discuss your approach here first before starting any work!
Author
Owner

@pirate commented on GitHub (Jan 12, 2024):

@benmuth this is actually higher priority than the upgrade banner stuff, can you switch to working on this when you get a chance?

ArchiveBox's niche is being able to download a wide array of content into its raw constituent parts, and we've been needing a photo gallery downloading ability for a while!


Because this is a larger ticket I'm down to pay out more, I expect it to take a bit of work to prototype and test since it requires touching a lot of different parts of the codebase.

The process to add a new extractor is involved, but fairly well documented here: https://github.com/ArchiveBox/ArchiveBox#contributing-a-new-extractor

Let me know if you have any questions!

<!-- gh-comment-id:1888476163 --> @pirate commented on GitHub (Jan 12, 2024): @benmuth this is actually higher priority than the upgrade banner stuff, can you switch to working on this when you get a chance? ArchiveBox's niche is being able to download a wide array of content into its raw constituent parts, and we've been needing a photo gallery downloading ability for a while! --- Because this is a larger ticket I'm down to pay out more, I expect it to take a bit of work to prototype and test since it requires touching a lot of different parts of the codebase. The process to add a new extractor is involved, but fairly well documented here: https://github.com/ArchiveBox/ArchiveBox#contributing-a-new-extractor Let me know if you have any questions!
Author
Owner

@benmuth commented on GitHub (Jan 20, 2024):

@pirate Should the media extractor also handle gallery-dl? I'm not sure if the media extractor is just an alias for youtube-dl or if it's supposed to handle other visual media as well.

Related: should gallery-dl config be controlled by the media options similar to the youtube-dl config? For example, should the default value for USE_GALLERYDL be determined by SAVE_MEDIA?

<!-- gh-comment-id:1902247904 --> @benmuth commented on GitHub (Jan 20, 2024): @pirate Should the `media` extractor also handle `gallery-dl`? I'm not sure if the `media` extractor is just an alias for `youtube-dl` or if it's supposed to handle other visual media as well. Related: should `gallery-dl` config be controlled by the `media` options similar to the `youtube-dl` config? For example, should the default value for `USE_GALLERYDL` be determined by `SAVE_MEDIA`?
Author
Owner

@pirate commented on GitHub (Jan 20, 2024):

Check out my WIP PR for the new plugin system, I added a placeholder for gallerydl but it's not fleshed out yet.

<!-- gh-comment-id:1902414266 --> @pirate commented on GitHub (Jan 20, 2024): Check out my WIP PR for the new plugin system, I added a placeholder for gallerydl but it's not fleshed out yet.
Author
Owner

@benmuth commented on GitHub (Jan 21, 2024):

Ah cool! Should I still keep working on a gallery-dl branch based off dev, or should I wait until the plugin system has merged?

<!-- gh-comment-id:1902474179 --> @benmuth commented on GitHub (Jan 21, 2024): Ah cool! Should I still keep working on a `gallery-dl` branch based off `dev`, or should I wait until the plugin system has merged?
Author
Owner

@pirate commented on GitHub (Jan 23, 2024):

Wait until the plugin system has merged @benmuth, thanks. sorry to interrupt your work

<!-- gh-comment-id:1907090603 --> @pirate commented on GitHub (Jan 23, 2024): Wait until the plugin system has merged @benmuth, thanks. sorry to interrupt your work
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#3378
No description provided.