[PR #570] [MERGED] Full-text search (rebased) #4239

Closed
opened 2026-03-15 01:33:55 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/570
Author: @pirate
Created: 12/5/2020
Status: Merged
Merged: 12/5/2020
Merged by: @pirate

Base: v0.5.0Head: sonic-search


📝 Commits (10+)

  • b1f70b2 Initial implementation
  • 5f6673c Implement backend architecture for search engines
  • c2c01af Add config for search backend
  • 47daa03 Implement flush for search backend after remove command
  • f383648 Use a generator for snapshot flush from index
  • 823df34 Use QuerySets for search backend API instead of pks
  • fb67d66 fix: Return empty QuerySet instead of list
  • 0f7dba0 feat: add search filter-type to list command
  • 0773f12 Add sonic to docker-compose
  • a38e3e0 Get searc backend password from env var SEARCH_BACKEND_PASSWORD

📊 Changes

20 files changed (+406 additions, -12 deletions)

View changed files

📝 Dockerfile (+1 -1)
archivebox.egg-info (+0 -1)
📝 archivebox/cli/archivebox_list.py (+1 -1)
📝 archivebox/cli/archivebox_update.py (+1 -1)
📝 archivebox/config.py (+13 -1)
📝 archivebox/core/admin.py (+4 -1)
archivebox/core/mixins.py (+23 -0)
📝 archivebox/core/models.py (+14 -3)
📝 archivebox/extractors/__init__.py (+5 -0)
📝 archivebox/extractors/readability.py (+5 -2)
📝 archivebox/index/__init__.py (+28 -1)
📝 archivebox/index/schema.py (+1 -0)
📝 archivebox/main.py (+3 -0)
archivebox/search/__init__.py (+110 -0)
archivebox/search/backends/__init__.py (+0 -0)
archivebox/search/backends/ripgrep.py (+47 -0)
archivebox/search/backends/sonic.py (+28 -0)
archivebox/search/utils.py (+44 -0)
📝 docker-compose.yml (+12 -0)
etc/sonic/config.cfg (+66 -0)

📄 Description

This is just a copy of #543 with the conflicts fixed.

image


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/570 **Author:** [@pirate](https://github.com/pirate) **Created:** 12/5/2020 **Status:** ✅ Merged **Merged:** 12/5/2020 **Merged by:** [@pirate](https://github.com/pirate) **Base:** `v0.5.0` ← **Head:** `sonic-search` --- ### 📝 Commits (10+) - [`b1f70b2`](https://github.com/ArchiveBox/ArchiveBox/commit/b1f70b219736378170c1dcda1131792bf83c1830) Initial implementation - [`5f6673c`](https://github.com/ArchiveBox/ArchiveBox/commit/5f6673c72c472ce23f192e7661ec449134fbf463) Implement backend architecture for search engines - [`c2c01af`](https://github.com/ArchiveBox/ArchiveBox/commit/c2c01af3adfd69c1984b5c6b2cdc1aa59b08c32b) Add config for search backend - [`47daa03`](https://github.com/ArchiveBox/ArchiveBox/commit/47daa038eb61674df22345e99201472ea770762c) Implement flush for search backend after remove command - [`f383648`](https://github.com/ArchiveBox/ArchiveBox/commit/f383648ffc80e64bfa399efc5e1b7766fe7de3dd) Use a generator for snapshot flush from index - [`823df34`](https://github.com/ArchiveBox/ArchiveBox/commit/823df34080a0ac8aa9cc6d4e9d689a3d4cf84309) Use QuerySets for search backend API instead of pks - [`fb67d66`](https://github.com/ArchiveBox/ArchiveBox/commit/fb67d6684c4ba229450767ab8afef2a7b158cd99) fix: Return empty QuerySet instead of list - [`0f7dba0`](https://github.com/ArchiveBox/ArchiveBox/commit/0f7dba07dfe673d5915c1bfb344a24b4cb027e84) feat: add search filter-type to list command - [`0773f12`](https://github.com/ArchiveBox/ArchiveBox/commit/0773f12034239304aea3dbccf61edcf0392201f4) Add sonic to docker-compose - [`a38e3e0`](https://github.com/ArchiveBox/ArchiveBox/commit/a38e3e0c90ad8954dfe151e83c68af9c04cf4f42) Get searc backend password from env var SEARCH_BACKEND_PASSWORD ### 📊 Changes **20 files changed** (+406 additions, -12 deletions) <details> <summary>View changed files</summary> 📝 `Dockerfile` (+1 -1) ➖ `archivebox.egg-info` (+0 -1) 📝 `archivebox/cli/archivebox_list.py` (+1 -1) 📝 `archivebox/cli/archivebox_update.py` (+1 -1) 📝 `archivebox/config.py` (+13 -1) 📝 `archivebox/core/admin.py` (+4 -1) ➕ `archivebox/core/mixins.py` (+23 -0) 📝 `archivebox/core/models.py` (+14 -3) 📝 `archivebox/extractors/__init__.py` (+5 -0) 📝 `archivebox/extractors/readability.py` (+5 -2) 📝 `archivebox/index/__init__.py` (+28 -1) 📝 `archivebox/index/schema.py` (+1 -0) 📝 `archivebox/main.py` (+3 -0) ➕ `archivebox/search/__init__.py` (+110 -0) ➕ `archivebox/search/backends/__init__.py` (+0 -0) ➕ `archivebox/search/backends/ripgrep.py` (+47 -0) ➕ `archivebox/search/backends/sonic.py` (+28 -0) ➕ `archivebox/search/utils.py` (+44 -0) 📝 `docker-compose.yml` (+12 -0) ➕ `etc/sonic/config.cfg` (+66 -0) </details> ### 📄 Description This is just a copy of #543 with the conflicts fixed. ![image](https://user-images.githubusercontent.com/511499/101267520-95b74400-3727-11eb-91a4-b4a0cebad03e.png) --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-15 01:33:55 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#4239
No description provided.