[GH-ISSUE #1291] Feature Request: New extractor to download javascript/ts sourcemap files for any compiled/minified .js assets used in archived pages #2305

Open
opened 2026-03-01 17:58:03 +03:00 by kerem · 3 comments
Owner

Originally created by @jensolsson on GitHub (Dec 17, 2023).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1291

I love how ArchiveBox is downloading javascript files from a specific site, however to make minified javascript readable it would be great to also include the .map file (called javascript source maps. If I understand correctly they are the same name but ends in .js.map instead of .js). Can this be added easily ?

If someone would give me some pointers on where to start I could probably do the work

Originally created by @jensolsson on GitHub (Dec 17, 2023). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1291 I love how ArchiveBox is downloading javascript files from a specific site, however to make minified javascript readable it would be great to also include the .map file (called javascript source maps. If I understand correctly they are the same name but ends in .js.map instead of .js). Can this be added easily ? If someone would give me some pointers on where to start I could probably do the work
Author
Owner

@pirate commented on GitHub (Dec 17, 2023):

This would likely require a new extractor, since none of our existing extractors try to download or are even aware of .map files.

You can check out our docs on that process here: https://github.com/ArchiveBox/ArchiveBox#contributing-a-new-extractor

Though to be honest this one would be fairly low on my priority list as it's just for developer experience and doesn't visually impact replay fidelity.

If you're interested in contributing an extractor, we'd love to have help adding one of these higher-impact ones, and I'm offering $250~$1000+ bounties for contributions at the moment:

  • an extractor to save discussion threads from forums, comment sections, reddit, twitter, etc as markdown/json
  • an extractor to fetch galleries of images e.g. gallery-dl
  • an extractor to fetch linked 3D assets like meshes, shaders, STL files, and other CAD models
  • an extractor to fetch research papers referenced by DOI numbers from scihub/jstor/etc
<!-- gh-comment-id:1859307619 --> @pirate commented on GitHub (Dec 17, 2023): This would likely require a new extractor, since none of our existing extractors try to download or are even aware of `.map` files. You can check out our docs on that process here: https://github.com/ArchiveBox/ArchiveBox#contributing-a-new-extractor Though to be honest this one would be fairly low on my priority list as it's just for developer experience and doesn't visually impact replay fidelity. If you're interested in contributing an extractor, we'd love to have help adding one of these higher-impact ones, and I'm offering $250~$1000+ bounties for contributions at the moment: - an extractor to save discussion threads from forums, comment sections, reddit, twitter, etc as markdown/json - an extractor to fetch galleries of images e.g. `gallery-dl` - an extractor to fetch linked 3D assets like meshes, shaders, STL files, and other CAD models - an extractor to fetch research papers referenced by DOI numbers from scihub/jstor/etc
Author
Owner

@Myestery commented on GitHub (Oct 24, 2024):

Hello @pirate, if this bounty is still up I'll like to work on it

<!-- gh-comment-id:2436413403 --> @Myestery commented on GitHub (Oct 24, 2024): Hello @pirate, if this bounty is still up I'll like to work on it
Author
Owner

@pirate commented on GitHub (Oct 24, 2024):

The bounty is unfortunatley no longer available, but once the plugin ecosystem is released in v0.9.0 https://docs.sweeting.me/s/archivebox-plugin-ecosystem-announcement it should be much easier to contribute new extractors, please check back in a bit!

<!-- gh-comment-id:2436442417 --> @pirate commented on GitHub (Oct 24, 2024): The bounty is unfortunatley no longer available, but once the plugin ecosystem is released in v0.9.0 https://docs.sweeting.me/s/archivebox-plugin-ecosystem-announcement it should be much easier to contribute new extractors, please check back in a bit!
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2305
No description provided.