[GH-ISSUE #612] Aditional languages (rus, ukr) doesn't choosing in OCR language list #483

Closed
opened 2026-02-25 21:32:01 +03:00 by kerem · 3 comments
Owner

Originally created by @deimjons on GitHub (Apr 20, 2024).
Original GitHub issue: https://github.com/ciur/papermerge/issues/612

Originally assigned to: @ciur on GitHub.

Description
Hello.
I am using a custom docker image with Russian and Ukrainian language packages for tesseract installed, following instructions: https://docs.papermerge.io/3.0/setup/add-ocr-langs/
Dockerfile:

FROM papermerge/papermerge:3.1

# add Ukrainian and Russian OCR languages
RUN apt install tesseract-ocr-rus tesseract-ocr-ukr

Info:

  • Papermerge Version [e.g. 3.1]
    In the container are added languages:
# docker exec -it papermerge bash
# tesseract --list-langs
List of available languages in "/usr/share/tesseract-ocr/5/tessdata/" (11):
deu
eng
fra
ita
nld
osd
por
ron
rus
spa
ukr

I see these languages in OCR languages list but can not choose and Run OCR

Screenshot 2024-04-19 at 18 45 39
Originally created by @deimjons on GitHub (Apr 20, 2024). Original GitHub issue: https://github.com/ciur/papermerge/issues/612 Originally assigned to: @ciur on GitHub. **Description** Hello. I am using a custom docker image with Russian and Ukrainian language packages for tesseract installed, following instructions: https://docs.papermerge.io/3.0/setup/add-ocr-langs/ Dockerfile: ``` FROM papermerge/papermerge:3.1 # add Ukrainian and Russian OCR languages RUN apt install tesseract-ocr-rus tesseract-ocr-ukr ``` **Info:** - Papermerge Version [e.g. 3.1] In the container are added languages: ``` # docker exec -it papermerge bash # tesseract --list-langs List of available languages in "/usr/share/tesseract-ocr/5/tessdata/" (11): deu eng fra ita nld osd por ron rus spa ukr ``` I see these languages in OCR languages list but can not choose and **Run OCR** <img width="1710" alt="Screenshot 2024-04-19 at 18 45 39" src="https://github.com/ciur/papermerge/assets/10740038/b055abff-f5fa-4332-93c3-dbbad86c636b">
Author
Owner

@ciur commented on GitHub (Apr 21, 2024):

Yes, because "rus" and "ukr" language codes are missing in following places:

I would gladly accept your pull request.

<!-- gh-comment-id:2067907846 --> @ciur commented on GitHub (Apr 21, 2024): Yes, because "rus" and "ukr" language codes are missing in following places: - https://github.com/papermerge/papermerge-core/blob/master/ui/src/types.ts#L269 - https://github.com/papermerge/papermerge-core/blob/master/ui/src/cconstants.ts#L10 I would gladly accept your pull request.
Author
Owner

@bl1nkker commented on GitHub (Apr 2, 2025):

@deimjons This issue is related to https://github.com/ciur/papermerge/issues/624, where I have provided detailed steps on how to add new OCR languages to Papermerge

<!-- gh-comment-id:2772347498 --> @bl1nkker commented on GitHub (Apr 2, 2025): @deimjons This issue is related to https://github.com/ciur/papermerge/issues/624, where I have provided detailed steps on how to add new OCR languages to Papermerge
Author
Owner

@ciur commented on GitHub (May 8, 2025):

Docs were updated with steps how to add additional languages: https://docs.papermerge.io/latest/setup/add-ocr-langs/

<!-- gh-comment-id:2861777968 --> @ciur commented on GitHub (May 8, 2025): Docs were updated with steps how to add additional languages: https://docs.papermerge.io/latest/setup/add-ocr-langs/
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/papermerge#483
No description provided.