[GH-ISSUE #771] Feature Request: Ability to merge multiple webpages into one snapshot #487

Closed
opened 2026-03-01 14:44:04 +03:00 by kerem · 1 comment
Owner

Originally created by @Victor239 on GitHub (Jun 14, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/771

Type

  • General question or discussion
  • Propose a brand new feature
  • Request modification of existing behavior or design

What is the problem that your feature request solves

I was viewing a XenForo forum thread and wanted to be able to archive all of it.

Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes

I don't expect ArchiveBox to support identifying the next thread page and archiving it, and being able to do this for all forum software.

Instead I'd like if I could manually supply all the thread page URLs I'm interested in, and for ArchiveBox to be able to merge these webpages into a single snapshot. If you're interested in doing a full thread download I'd be very happy with that too but I also don't want it to be a maintenance burden. The merge-all-webpages-into-a-single-snapshot functionality is sufficient for now.

What hacks or alternative solutions have you tried to solve the problem?

Archiving each webpage of a thread separately.

How badly do you want this new feature?

  • It's an urgent deal-breaker, I can't live without it
  • It's important to add it in the near-mid term future
  • It would be nice to have eventually
Originally created by @Victor239 on GitHub (Jun 14, 2021). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/771 ## Type - [ ] General question or discussion - [x] Propose a brand new feature - [ ] Request modification of existing behavior or design ## What is the problem that your feature request solves I was viewing a XenForo forum thread and wanted to be able to archive all of it. ## Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes I don't expect ArchiveBox to support identifying the next thread page and archiving it, and being able to do this for all forum software. Instead I'd like if I could manually supply all the thread page URLs I'm interested in, and for ArchiveBox to be able to merge these webpages into a single snapshot. If you're interested in doing a full thread download I'd be very happy with that too but I also don't want it to be a maintenance burden. The merge-all-webpages-into-a-single-snapshot functionality is sufficient for now. ## What hacks or alternative solutions have you tried to solve the problem? Archiving each webpage of a thread separately. ## How badly do you want this new feature? - [ ] It's an urgent deal-breaker, I can't live without it - [x] It's important to add it in the near-mid term future - [ ] It would be nice to have eventually
kerem 2026-03-01 14:44:04 +03:00
Author
Owner

@pirate commented on GitHub (Jun 14, 2021):

Probably not going to implement this as it would break a lot of the core assumptions in the data layer model. I'd recommend using tags to group related pages instead.

I the future we may include user scrips that can unroll many forum threads on a single page before archiving, which should also help solve this need.

<!-- gh-comment-id:861052492 --> @pirate commented on GitHub (Jun 14, 2021): Probably not going to implement this as it would break a lot of the core assumptions in the data layer model. I'd recommend using tags to group related pages instead. I the future we may include user scrips that can unroll many forum threads on a single page before archiving, which should also help solve this need.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#487
No description provided.