mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 09:06:02 +03:00
[GH-ISSUE #630] Question: Obey uBlock filter rules? #391
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#391
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @winteriscariot on GitHub (Jan 22, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/630
Is it possible for any archive methods to obey uBlock filter rules added via the 'picker' utility in uBlock (you can find the full list of them in the 'My Filters' tab in uBlock settings). This would be for uBlock installed in the chrome profile identified in the config.
My other option would be to process the resulting archived files separately to remove the indicated elements, but if archivebox is using chrome to capture the website anyway figure it might be worth not pulling those elements at all (if possible)
thanks!
@winteriscariot commented on GitHub (Jan 22, 2021):
Issue #211 mentions being able to use ublock:
however it doesn't appear to be obeying user filters
@jacobwhall commented on GitHub (Feb 21, 2021):
I would also like to get this set up. According to #211 that you linked, this can be achieved by using the CHROME_USER_DATA_DIR option. You might also find #516 helpful. Good luck!
@pirate commented on GitHub (Feb 25, 2021):
This is a valid question, but I'm actually going to close this in favor of keeping the discussion here: https://github.com/ArchiveBox/ArchiveBox/issues/211, because I already had uBock Origin + Ghostery in mind for the implementation of that ticket.
Right now, using
CHROME_USER_DATA_DIRwith a profile that has the extension works for some people, but is buggy/impossible on other OS/browser/docker combos.I have an eventual more elegant solution in mind (involving a config option for chrome extensions + a profile to run) but it is blocked by:
playwrightextractor dependency in favor of the old chrome-headless CLI / pupppeteer approach (2-4 months)If you hit "subscribe" over on https://github.com/ArchiveBox/ArchiveBox/issues/211 I'll notify you when we get close to it.
If you can think of any quick-and-dirty solutions to make this easier that would get us there before the
playwrightimplementation, I'm all ears, please suggest them over on #211!