[GH-ISSUE #1618] Bug: v0.7.2 single-file not compatible with older node JS versions #2479

Closed
opened 2026-03-01 17:59:18 +03:00 by kerem · 1 comment
Owner

Originally created by @mmhussain-aankoop on GitHub (Dec 12, 2024).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1618

Originally assigned to: @pirate on GitHub.

Provide a screenshot and describe the bug

Hi, I am able archive files just not in single-file type. When I run archivebox setup I see dependency installed √ SINGLEFILE_BINARY v1.1.54 valid ./node_modules/single-file-cli/single-file but when I try to archive a page then I get the error (error in logs and error block) I am trying to archive only file types singlefile, PDF and screenshot some how it still archives wget,htmltotext and archive_org > pdf
> screenshot
> wget
> htmltotext
> archive_org

Steps to reproduce

archivebox add 'https://some_url'

Logs or errors

> ./archive/1733996452.815462
      > singlefile
        Extractor failed:
             SingleFile was not able to archive the page
        Run to see full output:
            cd /home/archivebox/data1/archive/1733996452.815462;
            /home/archivebox/data1/node_modules/single-file-cli/single-file --browser-executable-path=/home/archivebox/.cache/ms-playwright/chromium-1148/chrome-linux/chrome "--browser-args=[\"--headless=new\", \"--user-agent=Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/118.0.0.0 Safari/537.36 ArchiveBox/0.7.2 (+https://github.com/ArchiveBox/ArchiveBox/)\", \"--window-size=1440,2000\"]" "https://www.ocu.org/reclamar" singlefile.html

ArchiveBox Version

0.7.2
ArchiveBox v0.7.2 BUILD_TIME=2024-12-11 18:08:22 1733936902
IN_DOCKER=False IN_QEMU=False ARCH=x86_64 OS=Linux PLATFORM=Linux-5.15.0-126-generic-x86_64-with-glibc2.35 PYTHON=Cpython
FS_ATOMIC=True FS_REMOTE=True FS_USER=1001:1001 FS_PERMS=644
DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND=ripgrep LDAP=False

[i] Dependency versions:
 √  PYTHON_BINARY         v3.10.12        valid     /usr/bin/python3.10
 √  SQLITE_BINARY         v2.6.0          valid     /usr/lib/python3.10/sqlite3/dbapi2.py
 √  DJANGO_BINARY         v3.1.14         valid     /usr/local/lib/python3.10/dist-packages/django/__init__.py
 √  ARCHIVEBOX_BINARY     v0.7.2          valid     /usr/local/bin/archivebox

 √  CURL_BINARY           v7.81.0         valid     /usr/bin/curl
 √  WGET_BINARY           v1.21.2         valid     /usr/bin/wget
 √  NODE_BINARY           v12.22.9        valid     /usr/bin/node
 √  SINGLEFILE_BINARY     v1.1.54         valid     ./node_modules/single-file-cli/single-file
 -  READABILITY_BINARY    -               disabled  ./node_modules/readability-extractor/readability-extractor
 -  MERCURY_BINARY        -               disabled  ./node_modules/@postlight/parser/cli.js
 √  GIT_BINARY            v2.34.1         valid     /usr/bin/git
 -  YOUTUBEDL_BINARY      -               disabled  /home/archivebox/.local/lib/python3.10/site-packages/youtube_dl/__main__.py
 √  CHROME_BINARY         v131.0.6778.33  valid     /home/archivebox/.cache/ms-playwright/chromium-1148/chrome-linux/chrome
 √  RIPGREP_BINARY        v13.0.0         valid     /usr/bin/rg

[i] Source-code locations:
 √  PACKAGE_DIR           23 files        valid     /usr/local/lib/python3.10/dist-packages/archivebox
 √  TEMPLATES_DIR         3 files         valid     /usr/local/lib/python3.10/dist-packages/archivebox/templates
 -  CUSTOM_TEMPLATES_DIR  -               disabled  None

[i] Secrets locations:
 -  CHROME_USER_DATA_DIR  -               disabled  None
 -  COOKIES_FILE          -               disabled  None

[i] Data locations:
 √  OUTPUT_DIR            11 files        valid     /home/archivebox/data1
 √  SOURCES_DIR           15 files        valid     ./sources
 √  LOGS_DIR              1 files         valid     ./logs
 √  ARCHIVE_DIR           89 files @      valid     /home/archivebox/data
 √  CONFIG_FILE           501.0 Bytes     valid     ./ArchiveBox.conf
 √  SQL_INDEX             664.0 KB        valid     ./index.sqlite3

How did you install the version of ArchiveBox you are using?

pip

What operating system are you running on?

Linux (Ubuntu/Debian/Arch/Alpine/etc.)

What type of drive are you using to store your ArchiveBox data?

  • data/ is on a local SSD or NVMe drive
  • data/ is on a spinning hard drive or external USB drive
  • data/ is on a network mount (e.g. NFS/SMB/CIFS/etc.)
  • data/ is on a FUSE mount (e.g. SSHFS/RClone/S3/B2/OneDrive, etc.)

Docker Compose Configuration


ArchiveBox Configuration

[SERVER_CONFIG]
SECRET_KEY = xxxxxxxxxxxxxxxxxx
ARCHIVE_DIR = /home/archivebox/data
CONFIG_FILE = /home/archivebox/data1/ArchiveBox.conf
ARCHIVE_METHODS = singlefile,pdf,screenshot
[DEPENDENCY_CONFIG]
YOUTUBEDL_BINARY = /home/archivebox/.local/lib/python3.10/site-packages/youtube_dl/__main__.py
CHROME_BINARY = /home/archivebox/.cache/ms-playwright/chromium-1148/chrome-linux/chrome
Originally created by @mmhussain-aankoop on GitHub (Dec 12, 2024). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1618 Originally assigned to: @pirate on GitHub. ### Provide a screenshot and describe the bug Hi, I am able archive files just not in single-file type. When I run archivebox setup I see dependency installed √ SINGLEFILE_BINARY v1.1.54 valid ./node_modules/single-file-cli/single-file but when I try to archive a page then I get the error (error in logs and error block) I am trying to archive only file types singlefile, PDF and screenshot some how it still archives wget,htmltotext and archive_org > pdf > screenshot > wget > htmltotext > archive_org ### Steps to reproduce ```markdown archivebox add 'https://some_url' ``` ### Logs or errors ```shell > ./archive/1733996452.815462 > singlefile Extractor failed: SingleFile was not able to archive the page Run to see full output: cd /home/archivebox/data1/archive/1733996452.815462; /home/archivebox/data1/node_modules/single-file-cli/single-file --browser-executable-path=/home/archivebox/.cache/ms-playwright/chromium-1148/chrome-linux/chrome "--browser-args=[\"--headless=new\", \"--user-agent=Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/118.0.0.0 Safari/537.36 ArchiveBox/0.7.2 (+https://github.com/ArchiveBox/ArchiveBox/)\", \"--window-size=1440,2000\"]" "https://www.ocu.org/reclamar" singlefile.html ``` ### ArchiveBox Version ```shell 0.7.2 ArchiveBox v0.7.2 BUILD_TIME=2024-12-11 18:08:22 1733936902 IN_DOCKER=False IN_QEMU=False ARCH=x86_64 OS=Linux PLATFORM=Linux-5.15.0-126-generic-x86_64-with-glibc2.35 PYTHON=Cpython FS_ATOMIC=True FS_REMOTE=True FS_USER=1001:1001 FS_PERMS=644 DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND=ripgrep LDAP=False [i] Dependency versions: √ PYTHON_BINARY v3.10.12 valid /usr/bin/python3.10 √ SQLITE_BINARY v2.6.0 valid /usr/lib/python3.10/sqlite3/dbapi2.py √ DJANGO_BINARY v3.1.14 valid /usr/local/lib/python3.10/dist-packages/django/__init__.py √ ARCHIVEBOX_BINARY v0.7.2 valid /usr/local/bin/archivebox √ CURL_BINARY v7.81.0 valid /usr/bin/curl √ WGET_BINARY v1.21.2 valid /usr/bin/wget √ NODE_BINARY v12.22.9 valid /usr/bin/node √ SINGLEFILE_BINARY v1.1.54 valid ./node_modules/single-file-cli/single-file - READABILITY_BINARY - disabled ./node_modules/readability-extractor/readability-extractor - MERCURY_BINARY - disabled ./node_modules/@postlight/parser/cli.js √ GIT_BINARY v2.34.1 valid /usr/bin/git - YOUTUBEDL_BINARY - disabled /home/archivebox/.local/lib/python3.10/site-packages/youtube_dl/__main__.py √ CHROME_BINARY v131.0.6778.33 valid /home/archivebox/.cache/ms-playwright/chromium-1148/chrome-linux/chrome √ RIPGREP_BINARY v13.0.0 valid /usr/bin/rg [i] Source-code locations: √ PACKAGE_DIR 23 files valid /usr/local/lib/python3.10/dist-packages/archivebox √ TEMPLATES_DIR 3 files valid /usr/local/lib/python3.10/dist-packages/archivebox/templates - CUSTOM_TEMPLATES_DIR - disabled None [i] Secrets locations: - CHROME_USER_DATA_DIR - disabled None - COOKIES_FILE - disabled None [i] Data locations: √ OUTPUT_DIR 11 files valid /home/archivebox/data1 √ SOURCES_DIR 15 files valid ./sources √ LOGS_DIR 1 files valid ./logs √ ARCHIVE_DIR 89 files @ valid /home/archivebox/data √ CONFIG_FILE 501.0 Bytes valid ./ArchiveBox.conf √ SQL_INDEX 664.0 KB valid ./index.sqlite3 ``` ### How did you install the version of ArchiveBox you are using? pip ### What operating system are you running on? Linux (Ubuntu/Debian/Arch/Alpine/etc.) ### What type of drive are you using to store your ArchiveBox data? - [ ] `data/` is on a local SSD or NVMe drive - [ ] `data/` is on a spinning hard drive or external USB drive - [x] `data/` is on a network mount (e.g. NFS/SMB/CIFS/etc.) - [ ] `data/` is on a FUSE mount (e.g. SSHFS/RClone/S3/B2/OneDrive, etc.) ### Docker Compose Configuration ```shell ``` ### ArchiveBox Configuration ```shell [SERVER_CONFIG] SECRET_KEY = xxxxxxxxxxxxxxxxxx ARCHIVE_DIR = /home/archivebox/data CONFIG_FILE = /home/archivebox/data1/ArchiveBox.conf ARCHIVE_METHODS = singlefile,pdf,screenshot [DEPENDENCY_CONFIG] YOUTUBEDL_BINARY = /home/archivebox/.local/lib/python3.10/site-packages/youtube_dl/__main__.py CHROME_BINARY = /home/archivebox/.cache/ms-playwright/chromium-1148/chrome-linux/chrome ```
kerem closed this issue 2026-03-01 17:59:19 +03:00
Author
Owner

@mmhussain-aankoop commented on GitHub (Dec 12, 2024):

issue was with Node js version not compatible. Update the Node js to version 20 and reinstalled single-file-cli and now it works

<!-- gh-comment-id:2538801105 --> @mmhussain-aankoop commented on GitHub (Dec 12, 2024): issue was with Node js version not compatible. Update the Node js to version 20 and reinstalled single-file-cli and now it works
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2479
No description provided.