mirror of
https://github.com/ciur/papermerge.git
synced 2026-04-25 03:55:58 +03:00
[GH-ISSUE #603] Feature Request: OCR support for digitally signed dcouments. #477
Labels
No labels
2.1
3.0
3.0.1
3.0.2
3.0.3
3.0.3
3.1
3.2
3.2
3.3
3.5
3.x
Fixed. Waiting for feedback.
Fixed. Waiting for feedback.
UX
Version 2.1 - alpha
XSS
announcement
beta
blocker
bug
cannot reproduce
confirmed
confirmed
critical
demo
dependencies
deployment
detchnical debt
discussion
docker
documentation
donations
duplicate
enhancement
feature request
frontend
fundraising
good first issue
good issue
help wanted
high
implemented
important
improvement
incomplete
invalid
investigation
kubernetes
low
low impact
medium
medium
medium impact
migration from 2.0
migration from 2.1
missing-language
missing-ocr-language
no-activity
note
ocr
outofscope
packaging
performance
popular request
pull-request
pypi
question
raspberry pi
roadmap
search
security
setup
status
task
technical debt
updates
user xp
version 1.4.0 - demo
will be implemented
will not be implemented
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/papermerge#477
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ShakataGaNai on GitHub (Mar 1, 2024).
Original GitHub issue: https://github.com/ciur/papermerge/issues/603
Originally assigned to: @ciur on GitHub.
Running v3.1 out of docker containers for testing (per https://docs.papermerge.io/3.1/setup/docker-compose/ ). When you upload and attempt to OCR a digitally signed document, the process fails silently. Looking at the logs (from the worker) finds a logical error message:
I can't find any mention of this anywhere, but supporting OCR for digitally signed documents would be nice. Perhaps the version dropdown can indicate something like "Version X w/ OCRed and w/o digital signature". Honestly, I don't even care about accessing a version of the document with OCR'd text, so long as the text is there for full text search. Especially when dealing with a multiplicity of signed legal documents.
@ciur commented on GitHub (Mar 3, 2024):
Thank you for opening this ticket.
Would you mind uploading a digitally signed document that I can experiment with? Of course, I mean document without sensitive information. One page document (digitally signed) with a couple of words would do the job just fine.
This will help me understand your request better and, of course, validate the feature while developing it.
@ShakataGaNai commented on GitHub (Mar 3, 2024):
Attaching 3. One is a digital document pushed right through docusign. One is the same document printed then scanned, and through docusign. The third is the same print/scan document signed with Adobe Acrobat (which I'm least confident in working, because Adobe...)
Lipsum scan - adobe signed.pdf
Lipsum scan - docusign.pdf
lipsum - docusign.pdf
@bluekitedreamer commented on GitHub (Apr 23, 2024):
Possibly a simple issue to fix, see another issue recently filed here with solution suggestion (https://github.com/ciur/papermerge/issues/614#issue-2255198217).