mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 17:16:00 +03:00
[GH-ISSUE #673] How to search for site URL? #1931
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#1931
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @voarsh2 on GitHub (Mar 24, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/673
I have various sites and urls for different parts of a site indexed, however, searching for URLs don't seem to work, at all.
The only thing what works with search using the UI is the website title, which isn't good enough for me.
Any work arounds?
@pirate commented on GitHub (Mar 24, 2021):
URL search works but it has to be exact, this is a known issue fixed in v0.6 (coming soon). If you want you can try running archivebox from the pre-release v0.6
debug-toolbarbranch to get this fix.@saywebsolutions commented on GitHub (Mar 24, 2021):
you can also use filesystem tools like grep, fzf, etc. to search the archive html and text files directly which will get you full text search and a lot of flexibility
@voarsh2 commented on GitHub (Mar 25, 2021):
I eagerly await this fix then... because I'm typing almost exact URLs (base domain) with no results....
Uh, yeah, but I'd rather use the UI... especially for bulk update, tag, delete, etc.
@voarsh2 commented on GitHub (Apr 2, 2021):
Just thought I would add that [site]/public search performs much better than the native djanjo admin/site snapshot screens. How come?
I want to delete/rescan, etc but I can't search in admin area effectly to select 30+ records, but no problem on public version.
@pirate commented on GitHub (Apr 2, 2021):
We implemented the public search ourselves, but the admin search is django's built-in search function that's really difficult to customize. Still working out the best way to override it to use our algo instead.
github.com/ArchiveBox/ArchiveBox@7162649b03/archivebox/core/admin.py (L58)github.com/ArchiveBox/ArchiveBox@7162649b03/archivebox/core/mixins.py (L5)@voarsh2 commented on GitHub (Apr 4, 2021):
I figured that. At least make /public/ have the ability to remove content? That would get around that issue.
Because I can't manage content at /public/ and it's the only place I can actually search properly.
@pirate commented on GitHub (Apr 5, 2021):
Cant do that, public is read-only, but what you can do is search on public to find the timestamp, then search using the timestamp on private until v0.6 is released.
@pirate commented on GitHub (Apr 6, 2021):
Ok I just pushed a real fix
cadac48instead of the hack I had before. This is officially fixed in v0.6 (which is on dev/master). I'll be pushing the Docker image, pip package, apt package, etc release versions soon.I'm going to close this as fixed for now to cleanup the issue backlog, feel free to comment back here if you're still having issues after the new release drops and I'll reopen the issue.