[GH-ISSUE #19] Feature Request: Chinese (Simplified and Traditional) and Japanese Language Analyzer #19

Closed
opened 2026-02-27 15:54:29 +03:00 by kerem · 1 comment
Owner

Originally created by @mspencer08 on GitHub (Apr 18, 2017).
Original GitHub issue: https://github.com/RD17/ambar/issues/19

Hi there,

I found this great app on reddit and am trying it out, both on your public cloud and on local dev machine using docker. Now I 'd like to ask if you can add the language analyzer for Chinese (Simplified and Traditional) and Japanese?

The CJK are some tricky languages to deal with and here's what I found on stackoverflow regarding the Chinese and Japanese language analyzers: http://stackoverflow.com/questions/29098347/elasticsearch-cjk-language-analyser. Sadly there isn't one for Korean yet.

I also want to curious about how well the current Tesseract OCR tuning included in ambar works with CJK languages.

Thanks.

Originally created by @mspencer08 on GitHub (Apr 18, 2017). Original GitHub issue: https://github.com/RD17/ambar/issues/19 Hi there, I found this great app on reddit and am trying it out, both on your public cloud and on local dev machine using docker. Now I 'd like to ask if you can add the language analyzer for Chinese (Simplified and Traditional) and Japanese? The CJK are some tricky languages to deal with and here's what I found on stackoverflow regarding the Chinese and Japanese language analyzers: http://stackoverflow.com/questions/29098347/elasticsearch-cjk-language-analyser. Sadly there isn't one for Korean yet. I also want to curious about how well the current Tesseract OCR tuning included in ambar works with CJK languages. Thanks.
kerem 2026-02-27 15:54:29 +03:00
Author
Owner

@sochix commented on GitHub (Apr 20, 2017):

Done

<!-- gh-comment-id:295889087 --> @sochix commented on GitHub (Apr 20, 2017): Done
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ambar#19
No description provided.