mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 17:16:00 +03:00
[GH-ISSUE #1590] Feature Request: JSON-based search API for SearxNG #3963
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#3963
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @jrruethe on GitHub (Nov 11, 2024).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1590
Originally assigned to: @pirate on GitHub.
What type of suggestion are you making?
Proposing a new feature
What is the problem that your feature request solves?
I love Archivebox, and have almost 20k snapshots. The Sonic search method works great! I would love to be able to hook Archivebox into SearxNG as a search provider. The easiest way would be if there was a way to do am Archivebox search using an API that returned the results in JSON format. From what I can tell, the search results are currently done server-side and HTML is returned.
What is your proposed solution?
Some sort of /api/search?query="blah" endpoint that returns search results in JSON format.
What hacks or alternative solutions have you tried to solve the problem?
Currently, I have python code that scrapes the Archivebox HTML results and extracts the titles / urls using CSS selectors.
What version of ArchiveBox are you currently using?
How badly do you want this new feature?
Mini Survey
@pirate commented on GitHub (Nov 12, 2024):
The new REST API in >= v0.8.5 has an
/api/v1/listendpoint which does exactly what you're looking for.https://demo.archivebox.io/api/v1/docs#/ArchiveBox%20CLI%20Sub-Commands/api_v1_cli_cli_list
Just pass
?as_json=true&filter_type=search&...and it'll seach using the full-text index and return JSON.