[GH-ISSUE #130] Feature request: Handle bulk scans seperated by QR/bar codes #99

Open
opened 2026-02-25 21:31:12 +03:00 by kerem · 5 comments
Owner

Originally created by @patrk on GitHub (Sep 19, 2020).
Original GitHub issue: https://github.com/ciur/papermerge/issues/130

Hello, I am currently giving this project a try to finally manage all my personal paper stack. Since I have to deal with a huge amount of accumulated documents and correspondence I would like to make use of bulk scans with separating pages.

It would be nice if the importer can handle those bulk scans and split the documents accordingly.

Might also contribute to the project after I made some progress importing the most important files into my papermerge instance :)

Originally created by @patrk on GitHub (Sep 19, 2020). Original GitHub issue: https://github.com/ciur/papermerge/issues/130 Hello, I am currently giving this project a try to finally manage all my personal paper stack. Since I have to deal with a huge amount of accumulated documents and correspondence I would like to make use of bulk scans with separating pages. It would be nice if the importer can handle those bulk scans and split the documents accordingly. Might also contribute to the project after I made some progress importing the most important files into my papermerge instance :)
Author
Owner

@ciur commented on GitHub (Sep 19, 2020):

Hi @patrk,
thank you for considering Papermerge.

Can you please be more specific about what is QR/bar codes role in separating pages?
A more detailed description of your use-case/context will help me to understand your request.

At this point you can bulk scan documents and later move pages around from one document to another (cut/paste). This is exactly how I use Papermerge (most of the time I bulk/batch scan).
Page management feature is described in documention here.
There is screencast demo of this feature as well.
You can see a batch scan example in this screencast demo as well.

<!-- gh-comment-id:695157709 --> @ciur commented on GitHub (Sep 19, 2020): Hi @patrk, thank you for considering Papermerge. Can you please be more specific about what is QR/bar codes role in separating pages? A more detailed description of your use-case/context will help me to understand your request. At this point you can bulk scan documents and later move pages around from one document to another (cut/paste). This is exactly how I use Papermerge (most of the time I bulk/batch scan). Page management feature is described in [documention here](https://papermerge.readthedocs.io/en/latest/page_management.html). There is screencast [demo](https://www.youtube.com/watch?v=CRhUpPqCI64) of this feature as well. You can see a [batch scan example in this screencast demo as well.](https://www.youtube.com/watch?v=OpwTaEN5t2Y)
Author
Owner

@patrk commented on GitHub (Sep 19, 2020):

Hello,

thanks for the quick response. I am glad that Papermerge already offers the functionality to edit and split documents by pages.

However, I have hundreds of letters and documents to scan and my batch sizes are around 50-100 pages per scan.

Currently, I print separator pages before each document consisting of a QR code. I have a python script which postprocesses such batch scan by removing the separator page and beginning a new PDF files. Doing this procedure manually is tedious and time consuming.

Therefore having that integrated in Papermerge would save me some time doing the processing myself. Perhaps you could even allow some processing API, where one would simply call their own scripts.

<!-- gh-comment-id:695200866 --> @patrk commented on GitHub (Sep 19, 2020): Hello, thanks for the quick response. I am glad that Papermerge already offers the functionality to edit and split documents by pages. However, I have hundreds of letters and documents to scan and my batch sizes are around 50-100 pages per scan. Currently, I print separator pages before each document consisting of a QR code. I have a python script which postprocesses such batch scan by removing the separator page and beginning a new PDF files. Doing this procedure manually is tedious and time consuming. Therefore having that integrated in Papermerge would save me some time doing the processing myself. Perhaps you could even allow some processing API, where one would simply call their own scripts.
Author
Owner

@jpguyon52 commented on GitHub (Sep 28, 2020):

I have the exact requirements as patrk is having.
I need to import more than 500 different documents (7 years on keeping) and would like to import them.
A page separator with automation would help me import those documents easily on a fast scanner and the automation would split the documents when it see a separator.

<!-- gh-comment-id:700192703 --> @jpguyon52 commented on GitHub (Sep 28, 2020): I have the exact requirements as patrk is having. I need to import more than 500 different documents (7 years on keeping) and would like to import them. A page separator with automation would help me import those documents easily on a fast scanner and the automation would split the documents when it see a separator.
Author
Owner

@patrk commented on GitHub (Sep 29, 2020):

@jpguyon52 Glad to hear that someone is having the same struggle in the transition to a paperless document archive. I might make a pull request in case the contributors are not prioritizing this feature.

<!-- gh-comment-id:700508368 --> @patrk commented on GitHub (Sep 29, 2020): @jpguyon52 Glad to hear that someone is having the same struggle in the transition to a paperless document archive. I might make a pull request in case the contributors are not prioritizing this feature.
Author
Owner

@ciur commented on GitHub (Sep 29, 2020):

ah, now I understand 👍 the whole picture and I fully agree that feature makes perfect sense.

The Paperless project has scripts feature though - which allows you to write scripts executed at various stages of consumption process. Papermerge on the other hand does not have that.

<!-- gh-comment-id:700760298 --> @ciur commented on GitHub (Sep 29, 2020): ah, now I understand :thumbsup: the whole picture and I fully agree that feature makes perfect sense. The Paperless project has [scripts feature](https://paperless.readthedocs.io/en/latest/consumption.html?highlight=scripts#hooking-into-the-consumption-process) though - which allows you to write scripts executed at various stages of consumption process. Papermerge on the other hand does not have that.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/papermerge#99
No description provided.