mirror of
https://github.com/ciur/papermerge.git
synced 2026-04-25 12:05:58 +03:00
[GH-ISSUE #598] Exclude document from OCR #471
Labels
No labels
2.1
3.0
3.0.1
3.0.2
3.0.3
3.0.3
3.1
3.2
3.2
3.3
3.5
3.x
Fixed. Waiting for feedback.
Fixed. Waiting for feedback.
UX
Version 2.1 - alpha
XSS
announcement
beta
blocker
bug
cannot reproduce
confirmed
confirmed
critical
demo
dependencies
deployment
detchnical debt
discussion
docker
documentation
donations
duplicate
enhancement
feature request
frontend
fundraising
good first issue
good issue
help wanted
high
implemented
important
improvement
incomplete
invalid
investigation
kubernetes
low
low impact
medium
medium
medium impact
migration from 2.0
migration from 2.1
missing-language
missing-ocr-language
no-activity
note
ocr
outofscope
packaging
performance
popular request
pull-request
pypi
question
raspberry pi
roadmap
search
security
setup
status
task
technical debt
updates
user xp
version 1.4.0 - demo
will be implemented
will not be implemented
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/papermerge#471
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @thndrbck on GitHub (Feb 16, 2024).
Original GitHub issue: https://github.com/ciur/papermerge/issues/598
Originally assigned to: @ciur on GitHub.
Forms filled in by hand don't need Optical Character Recognition. The OCR database would fill up with form field labels. Also, disk storage will fill up with unnecessary OCR duplicates.
If you could include a check box when uploading a file so that it is marked for no OCR, that would be helpful.
A toggle to turn off OCR when batch uploading documents would also be helpful.
@ciur commented on GitHub (Feb 17, 2024):
Thank you for opening this ticket!
This feature makes perfect sense and it is relatively easy to implement.
Will be implemented as part of next release 3.1, which will be out in couple of weeks.
@thndrbck commented on GitHub (Feb 19, 2024):
Re: Did you meant here exclude entire document from being OCRed - which is exactly as https://github.com/ciur/papermerge/issues/598 ?
Or did you really meant to exclude specific pages from being OCRed ?
In last case, i.e. when you mean to exclude specific pages from OCRed - it is not possible to implement. It is either entire document (i.e. all pages in the document) or nothing.
I meant not OCRing the entire document.
@ciur commented on GitHub (Feb 23, 2024):
Added
PR#332
Feature will be part of the 3.1.0 release.