mirror of
https://github.com/RD17/ambar.git
synced 2026-04-25 23:45:50 +03:00
[GH-ISSUE #194] Indexing multiple languages in one dataset #192
Labels
No labels
$$ Paid Support
bug
bug
enhancement
help wanted
invalid
pull-request
question
question
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ambar#192
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Loo0D on GitHub (Oct 21, 2018).
Original GitHub issue: https://github.com/RD17/ambar/issues/194
Hello!
We are trialling Ambar as a lightweight e-discovery product.
The documentation states:
Our sample dataset was configured with
ambar_en. The dataset contains English and Greek documents, and the search finds both, which is great. However, it would be good to understand what exactlylangAnalyzerflag does, i.e. does it only apply to tesseract/OCR?In other words, what are we missing on the Greek side (in this case) by setting the analyser to English?
Thanks!
@stale[bot] commented on GitHub (Nov 5, 2018):
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.