mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-26 01:26:00 +03:00
[GH-ISSUE #1689] Bug: Dependencies like Chrome are not detected in v0.7.3 Docker image despite being installed #4025
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#4025
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Guanchishan on GitHub (Jul 19, 2025).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1689
Originally assigned to: @pirate on GitHub.
Provide a screenshot and describe the bug
After upgrading the Docker image from v0.6.2 to v0.7.3, I encountered and fixed several database integrity errors. However,
archivebox statusstill reports 4 missing dependencies (CHROME_BINARY, SINGLEFILE_BINARY, etc.), showing them as "unable to detect version". This happens even after manually installing these dependencies inside the container as the root user. The system seems to have the dependencies, but ArchiveBox cannot detect them.I have tried many steps to fix this, including:
Entering the container as
root.Running
apt-get updateand thenplaywright install-depsto install all system libraries. This step completed successfully.Running
playwright installto install browser binaries. This also completed successfully.Running
chown -R archivebox:archivebox /nodeto fix potential npm permission issues.Despite all these steps, when running
archivebox statusas thearchiveboxuser, the dependencies are still reported as missing.Steps to reproduce
Logs or errors
ArchiveBox Version
How did you install the version of ArchiveBox you are using?
Docker (or Podman/LXC/K8s/TrueNAS/Proxmox/etc)
What operating system are you running on?
Linux (Ubuntu/Debian/Arch/Alpine/etc.)
What type of drive are you using to store your ArchiveBox data?
data/is on a local SSD or NVMe drivedata/is on a spinning hard drive or external USB drivedata/is on a network mount (e.g. NFS/SMB/Ceph/GlusterFS/etc.)data/is on a FUSE mount (e.g. SSHFS/RClone/S3/B2/Google Drive/Dropbox/etc.)Docker Environment
APP_VENV/app/.venvAPT_KEY_DONT_WARN_ON_DANGEROUS_USAGE1ARCHIVEBOX_USERarchiveboxCHROME_BINARYchromiumCHROME_SANDBOXFalseCODE_DIR/appDATA_DIR/dataDEBIAN_FRONTENDnoninteractiveDEFAULT_PGID911DEFAULT_PUID911GLOBAL_VENV/venvGPG_KEYE3FF2839C048B25C084DEBE9B26995E310250568IN_DOCKERTrueLANGC.UTF-8LANGUAGEen_US:enLC_ALLC.UTF-8MERCURY_BINARY/node/node_modules/.bin/mercury-parserNODE_DIR/nodeNODE_MODULES/app/node_modulesNODE_VERSION20PATH/usr/local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/node/node_modules/.bin:/venv/binPIP_DISABLE_PIP_VERSION_CHECK1PLAYWRIGHT_BROWSERS_PATH/browsersPYTHONIOENCODINGUTF-8PYTHONUNBUFFERED1PYTHON_GET_PIP_SHA25696461deced5c2a487ddc65207ec5a9cffeca0d34e7af7ea1afc470ff0d746207PYTHON_GET_PIP_URLhttps://github.com/pypa/get-pip/raw/0d8570dc44796f4369b652222cf176b3db6ac70e/public/get-pip.pyPYTHON_PIP_VERSION23.0.1PYTHON_SETUPTOOLS_VERSION58.1.0PYTHON_SHA2562a9920c7a0cd236de33644ed980a13cbbc21058bfdc528febb6081575ed73be3PYTHON_VERSION3.9.17READABILITY_BINARY/node/node_modules/.bin/readability-extractorSINGLEFILE_BINARY/node/node_modules/.bin/single-fileTZUTCUSE_MERCURYTrueUSE_READABILITYTrueUSE_SINGLEFILETrueVENV_PATH/venvnpm_config_loglevelerrorArchiveBox Configuration
@pirate commented on GitHub (Jul 21, 2025):
hmm those binaries ship inside the docker container already, they should require no manual installation: https://github.com/ArchiveBox/ArchiveBox/blob/v0.7.3/Dockerfile#L210
You can confirm with:
docker run -it archivebox/archivebox:latest versionAre you sure you havent attempted to re-install them inside the container or otherwise changed the container's filesystem?
Also check your
ArchiveBox.confto make sure you dont have custom paths set for those binaries. We don't use/nodeanymore anywhere. You can delete anyNODE_DIRconfig you have or any value that starts with/nodefrom your config.@Guanchishan commented on GitHub (Jul 23, 2025):
Hi, thanks for the suggestion.
I've now corrected my
ArchiveBox.conffile. As you suspected, I had a customCHROME_BINARYpath set. I have removed that line from my configuration.After updating the config, I completely reinstalled the container to ensure a clean state.
However, I'm now facing a new issue. When I run
archivebox setup, I encounter a permission error. The error message specifically refers to thebrowserfolder. I have already checked the permissions on the host machine, and they appear to be correct. This leads me to believe the problem originates from within the container itself, not from the host's filesystem.Could you provide any further guidance on how to resolve this permission issue inside the container?
Thanks for your help.
@pirate commented on GitHub (Jul 23, 2025):
No need to run
archivebox setupin docker as the container already ships with all the dependencies preinstalled. You only needarchivebox init.@Guanchishan commented on GitHub (Jul 23, 2025):
Hi @pirate, thank you for your continued support and suggestions.
Following your advice, I have taken the following steps:
archivebox initon my existing data volume, which completed successfully.archivebox update.Unfortunately, the original issue persists. Even with a clean container and no custom paths in
ArchiveBox.conf, the update process still fails with the sameFileNotFoundError. The error messages indicate that ArchiveBox is still attempting to locate binaries in the old/node/...path, for example:This is puzzling, as I believed that removing the custom paths from the config and using a fresh container would resolve this. It seems that the old configuration is still being referenced from somewhere.
I have avoided running
archivebox setupas you advised. Could this old path information be cached somewhere else, perhaps within theindex.sqlite3database itself? Any guidance on where to look next would be greatly appreciated.@pirate commented on GitHub (Jul 31, 2025):
can you run
docker run -it -v $PWD:/data archivebox/archivebox:latest configanddocker run -it -v $PWD:/data archivebox/archivebox:latest versionagain and share the output@Guanchishan commented on GitHub (Aug 1, 2025):
docker run -it -v $PWD:/data archivebox/archivebox:latest config:docker run -it -v $PWD:/data archivebox/archivebox:latest version:P.s.: I am confused as to why it would have an error related to
/datalike this:Because my ArchiveBox site is actually accessible, I also tried running
archivebox init, and the result was:And then, I ran
docker run -it -v $PWD:/data archivebox/archivebox:latest config, the returned content is still the same:@pirate commented on GitHub (Aug 30, 2025):
What directory are in you in on the host?
$PWDstoresdata/in the current directory, so make sure it's a new empty dir somewhere.After the first
docker run -v $PWD:/data archivebox/archivebox:latest initinside the data dir, then on the host, runchmod -R 777 ., then all docker commands after that should be run from inside the data dir, or change$PWDto the absolute path.Also don't ignore that warning about not having space remaining, archivebox will refuse to launch if it's below ~50mb.
@pirate commented on GitHub (Jan 8, 2026):
devhas an overhauled dependency management system, so I'm going to close this as stale as none of the relevant code is the same anymore. please open a new issue for any problems you encounter ondev!It's not stable yet, so dont upgrade any big archive collections, but stay tuned for the next stable release with all the changes.