mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 09:06:02 +03:00
[GH-ISSUE #786] Feature Request: Allow locally run ArchiveBox CLI commands to control a separate remote ArchiveBox backend #2009
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#2009
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @huajianmao on GitHub (Jul 8, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/786
Type
What is the problem that your feature request solves
I may use ArchiveBox in multiple machines and want to centrally store the archives in one server.
Describe the ideal specific solution you'd want, and whether it fits into any broader scope of changes
What hacks or alternative solutions have you tried to solve the problem?
How badly do you want this new feature?
@pirate commented on GitHub (Jul 8, 2021):
We might get this ability for free with the upcoming message passing architecture refactor https://github.com/ArchiveBox/ArchiveBox/issues/91#issuecomment-871343428, but it may take a while as that refactor is a big one.
For now I suggest using SSH/rsync to push your files to a dir in the central server e.g.
rsync /some/local/urls.txt central-server:/path/to/urls.txt,and then on the central server run:
archivebox schedule --every=day --depth=1 /path/to/urls.txt.Another way to submit URLs to a remote ArchiveBox instance is to make a POST request to the
/core/snapshot/add/endpoint with your URLs, see here for more info https://github.com/ArchiveBox/ArchiveBox/issues/577#issuecomment-872961973 and here for an example of how to make a request to the endpoint:github.com/tjhorner/archivebox-exporter@adef67d6b9/src/common/services/archiver.ts (L78).@huajianmao commented on GitHub (Jul 8, 2021):
I see.
ArchiveBox is awesome, and thanks for your suggestion! @pirate
@huajianmao commented on GitHub (Jul 8, 2021):
About the architecture refactor, how about refactor ArchiveBox to a client / server model?
Or keep the current architecture, and separate the url management operations to a thin ArchiveBox client which can be configured to use a remote ArchiveBox server to save the archives? @pirate
@huajianmao commented on GitHub (Jul 8, 2021):
Besides, it would be really great if a browser plugin could be provided.🤤
@pirate commented on GitHub (Jul 13, 2021):
This is already how the message-passing refactor works, it will be like client-server on steroids. User actions will cause tasks to be emitted to one of several task queues, which can then be processed by workers on the same machine or remotely. Any subcomponent of ArchiveBox will be configurable to run locally or remotely, so you can split up the CLI, the web backend, the parser workers, and the archiver workers onto any machines you want.
A user-contributed browser plugin is already available 😉, see here: https://github.com/ArchiveBox/ArchiveBox/issues/577#issuecomment-872915877
You'll have to run the pre-release version of ArchiveBox on
devto use it, read the full thread for details ^.@huajianmao commented on GitHub (Jul 14, 2021):
Great and thanks!
@jfinkhaeuser commented on GitHub (Apr 17, 2023):
What's the status here? I don't see so much progress, but this is pretty much the number one thing I'm missing.
@pirate commented on GitHub (Apr 19, 2023):
I would say don't expect this soon, as the temporary solution to use SSH works in almost all cases and it's a lot of complexity for me to maintain.
Higher priority for me right now is the REST API: https://github.com/ArchiveBox/ArchiveBox/issues/496
@jfinkhaeuser commented on GitHub (Apr 19, 2023):
Fair enough, thank you!
Yes, the workaround with SSH works. It's a tad more complex than I would like, but the important part is that there is something that can be done!