[GH-ISSUE #1243] Question: PDF file size unnecessarily large #2273

Closed
opened 2026-03-01 17:57:50 +03:00 by kerem · 1 comment
Owner

Originally created by @allen7u on GitHub (Oct 11, 2023).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1243

The PDF file size usually seems unnecessarily large, which will easily exceed 30M. Is there a way or config to compress them on the fly?
image

Originally created by @allen7u on GitHub (Oct 11, 2023). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1243 The PDF file size usually seems unnecessarily large, which will easily exceed 30M. Is there a way or config to compress them on the fly? ![image](https://github.com/ArchiveBox/ArchiveBox/assets/81082384/e2a9b590-8d1c-4994-b57f-5865fc78fdf7)
kerem closed this issue 2026-03-01 17:57:50 +03:00
Author
Owner

@pirate commented on GitHub (Oct 11, 2023):

PDF size is mostly dependent on the size of the images in the page, as it reproduces them in full resolution and doesn't do any compression by default to avoid losing quality.

I would recommend using an external utility or filesystem-level compression to reduce size, e.g. ZFS with -o compression=lz4. https://linuxhint.com/enable-zfs-compression/

You can also run a compression utility like ghostscript across all the PDFs in one go like so: https://www.digitalocean.com/community/tutorials/reduce-pdf-file-size-in-linux
some-compression-tool ./archive/*/output.pdf

<!-- gh-comment-id:1757089758 --> @pirate commented on GitHub (Oct 11, 2023): PDF size is mostly dependent on the size of the images in the page, as it reproduces them in full resolution and doesn't do any compression by default to avoid losing quality. I would recommend using an external utility or filesystem-level compression to reduce size, e.g. ZFS with `-o compression=lz4`. https://linuxhint.com/enable-zfs-compression/ You can also run a compression utility like ghostscript across all the PDFs in one go like so: https://www.digitalocean.com/community/tutorials/reduce-pdf-file-size-in-linux `some-compression-tool ./archive/*/output.pdf`
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2273
No description provided.