mirror of
https://github.com/ciur/papermerge.git
synced 2026-04-25 12:05:58 +03:00
[GH-ISSUE #149] Search for non-latin text is case sensitive. #117
Labels
No labels
2.1
3.0
3.0.1
3.0.2
3.0.3
3.0.3
3.1
3.2
3.2
3.3
3.5
3.x
Fixed. Waiting for feedback.
Fixed. Waiting for feedback.
UX
Version 2.1 - alpha
XSS
announcement
beta
blocker
bug
cannot reproduce
confirmed
confirmed
critical
demo
dependencies
deployment
detchnical debt
discussion
docker
documentation
donations
duplicate
enhancement
feature request
frontend
fundraising
good first issue
good issue
help wanted
high
implemented
important
improvement
incomplete
invalid
investigation
kubernetes
low
low impact
medium
medium
medium impact
migration from 2.0
migration from 2.1
missing-language
missing-ocr-language
no-activity
note
ocr
outofscope
packaging
performance
popular request
pull-request
pypi
question
raspberry pi
roadmap
search
security
setup
status
task
technical debt
updates
user xp
version 1.4.0 - demo
will be implemented
will not be implemented
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/papermerge#117
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ciur on GitHub (Oct 3, 2020).
Original GitHub issue: https://github.com/ciur/papermerge/issues/149
Originally assigned to: @ciur on GitHub.
Problem was reported for bulgarian language and 1.4.2 version. I tested it on Russian language and master branch.
Марк Аврелий.pdf
Expected
Uploaded document is expected to be present in search results as it contains words "Марк Аврелий".
All languages searches (cyrillic and latin) must not be case sensitive by default i.e.
if you search for "Аврелий" or "авреЛИй" (by default) results must be same.
Actual
Only if user searches for matching case- i.e. "Марк Аврелий" - uploaded document is revealed.
Desktop:
@ciur commented on GitHub (Oct 4, 2020):
I think the "problem" is in database. There is a known issue with SQLite database. In default setup it performs case sensitive setups for Unicode strings.
Still need to confirm that for PostgreSQL behaviour is correct.
@ciur commented on GitHub (Oct 4, 2020):
Confirmed. Application works as expected with PostgreSQL database. Thus, the problem is because of SQLite database.
I won't fix the issue, as SQLite is not meant to run in production environments.