mirror of
https://github.com/ciur/papermerge.git
synced 2026-04-25 03:55:58 +03:00
[GH-ISSUE #473] Alternative Search Engine / Manticore #366
Labels
No labels
2.1
3.0
3.0.1
3.0.2
3.0.3
3.0.3
3.1
3.2
3.2
3.3
3.5
3.x
Fixed. Waiting for feedback.
Fixed. Waiting for feedback.
UX
Version 2.1 - alpha
XSS
announcement
beta
blocker
bug
cannot reproduce
confirmed
confirmed
critical
demo
dependencies
deployment
detchnical debt
discussion
docker
documentation
donations
duplicate
enhancement
feature request
frontend
fundraising
good first issue
good issue
help wanted
high
implemented
important
improvement
incomplete
invalid
investigation
kubernetes
low
low impact
medium
medium
medium impact
migration from 2.0
migration from 2.1
missing-language
missing-ocr-language
no-activity
note
ocr
outofscope
packaging
performance
popular request
pull-request
pypi
question
raspberry pi
roadmap
search
security
setup
status
task
technical debt
updates
user xp
version 1.4.0 - demo
will be implemented
will not be implemented
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/papermerge#366
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ciur on GitHub (Aug 30, 2022).
Original GitHub issue: https://github.com/ciur/papermerge/issues/473
Originally assigned to: @ciur on GitHub.
Currently Papermerge DMS supports only one search engine backend - elasticsearch.
It would be great to support multiple search engines.
Consider manticoresearch.
Consider Apache Solr.
Consider Typesense.
Consider PostgreSQL FTS
Any other option ?
@telsch commented on GitHub (Sep 2, 2022):
The removed legacy database search engine for low resource hardware like raspberry pi?
@ciur commented on GitHub (Sep 2, 2022):
That's actually a good point. I really enjoyed PostgreSQL full text search :).
The disadvantage of that approach is that it is specifically bound one database type - PostgreSQL, but on the other hand
it is easier to use as there is no need for an extra service.
PostgreSQL FTS will be added back.
@ciur commented on GitHub (Sep 10, 2022):
These week I experimented with Django Haystack. I was pleasantly surprised - although package looks out of date - in reality it is very well written and works well.
When used, it adds support to four search backends: elasticsearch, solr, xapian and whoosh.
Solr and elasticsearch are well known, mature but very resource intensive (both have a minimum memory req > 4 GB RAM).
Whoosh it python only search engine. First impression - it works rather well for small set of data.
I was completely amazed by Xapian. I never heard of it (until couple of days ago) - but it works surprisingly well, is well documented and has great python bindings.
And best of all - are you ready? - it is written in C++ and has very small memory footprint.
Long story short - Xapian search is now included by default in Papermerge DMS. No configuration required.
Basically - Papermerge DMS, without any extra config comes with Xapian search engine.
In case are very adventurous and have lots of resources available - you can use Elasticsearch or Solr.
Somewhere in future I will include support Manticore - but for now, it is ok the way it is.