[GH-ISSUE #758] Bug: Import from file fails when using the new 'docker compose' #480

Closed
opened 2026-03-01 14:44:00 +03:00 by kerem · 1 comment
Owner

Originally created by @mcolsen on GitHub (May 30, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/758

Describe the bug

Importing a list of URLs via stdin fails when using the new unhyphenated docker compose command.

Steps to reproduce

docker compose run archivebox add < /path/to/file.txt

Screenshots or log output

[+] [2021-05-30 21:10:56] Adding 0 links to index (crawl depth=0) (index only)...
    > Saved verbatim input to sources/1622409056-import.txt
    > Parsed 0 URLs from input (Failed to parse)
    > Found 0 new URLs not already in index

[*] [2021-05-30 21:10:56] Writing 0 links to main index...
    √ ./index.sqlite3

docker-compose run archivebox add < /path/to/file.txt works as expected.

ArchiveBox version

ArchiveBox v0.6.2
Cpython Linux Linux-5.10.25-linuxkit-x86_64-with-glibc2.28 x86_64
IN_DOCKER=True DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND_ENGINE=sonic

[i] Dependency versions:
 √  ARCHIVEBOX_BINARY     v0.6.2          valid     /usr/local/bin/archivebox
 √  PYTHON_BINARY         v3.9.5          valid     /usr/local/bin/python3.9
 √  DJANGO_BINARY         v3.1.10         valid     /usr/local/lib/python3.9/site-packages/django/bin/django-admin.py
 √  CURL_BINARY           v7.64.0         valid     /usr/bin/curl
 √  WGET_BINARY           v1.20.1         valid     /usr/bin/wget
 √  NODE_BINARY           v15.14.0        valid     /usr/bin/node
 √  SINGLEFILE_BINARY     v0.3.16         valid     /node/node_modules/single-file/cli/single-file
 √  READABILITY_BINARY    v0.0.2          valid     /node/node_modules/readability-extractor/readability-extractor
 √  MERCURY_BINARY        v1.0.0          valid     /node/node_modules/@postlight/mercury-parser/cli.js
 √  GIT_BINARY            v2.20.1         valid     /usr/bin/git
 √  YOUTUBEDL_BINARY      v2021.04.26     valid     /usr/local/bin/youtube-dl
 √  CHROME_BINARY         v90.0.4430.93   valid     /usr/bin/chromium
 √  RIPGREP_BINARY        v0.10.0         valid     /usr/bin/rg

[i] Source-code locations:
 √  PACKAGE_DIR           22 files        valid     /app/archivebox
 √  TEMPLATES_DIR         3 files         valid     /app/archivebox/templates
 -  CUSTOM_TEMPLATES_DIR  -               disabled

[i] Secrets locations:
 -  CHROME_USER_DATA_DIR  -               disabled
 -  COOKIES_FILE          -               disabled

[i] Data locations:
 √  OUTPUT_DIR            9 files         valid     /data
 √  SOURCES_DIR           40 files        valid     ./sources
 √  LOGS_DIR              1 files         valid     ./logs
 √  ARCHIVE_DIR           1268 files      valid     ./archive
 √  CONFIG_FILE           81.0 Bytes      valid     ./ArchiveBox.conf
 √  SQL_INDEX             11.4 MB         valid     ./index.sqlite3
Originally created by @mcolsen on GitHub (May 30, 2021). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/758 #### Describe the bug Importing a list of URLs via stdin fails when using the new unhyphenated `docker compose` command. #### Steps to reproduce `docker compose run archivebox add < /path/to/file.txt` #### Screenshots or log output ```logs [+] [2021-05-30 21:10:56] Adding 0 links to index (crawl depth=0) (index only)... > Saved verbatim input to sources/1622409056-import.txt > Parsed 0 URLs from input (Failed to parse) > Found 0 new URLs not already in index [*] [2021-05-30 21:10:56] Writing 0 links to main index... √ ./index.sqlite3 ``` `docker-compose run archivebox add < /path/to/file.txt` works as expected. #### ArchiveBox version ```logs ArchiveBox v0.6.2 Cpython Linux Linux-5.10.25-linuxkit-x86_64-with-glibc2.28 x86_64 IN_DOCKER=True DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND_ENGINE=sonic [i] Dependency versions: √ ARCHIVEBOX_BINARY v0.6.2 valid /usr/local/bin/archivebox √ PYTHON_BINARY v3.9.5 valid /usr/local/bin/python3.9 √ DJANGO_BINARY v3.1.10 valid /usr/local/lib/python3.9/site-packages/django/bin/django-admin.py √ CURL_BINARY v7.64.0 valid /usr/bin/curl √ WGET_BINARY v1.20.1 valid /usr/bin/wget √ NODE_BINARY v15.14.0 valid /usr/bin/node √ SINGLEFILE_BINARY v0.3.16 valid /node/node_modules/single-file/cli/single-file √ READABILITY_BINARY v0.0.2 valid /node/node_modules/readability-extractor/readability-extractor √ MERCURY_BINARY v1.0.0 valid /node/node_modules/@postlight/mercury-parser/cli.js √ GIT_BINARY v2.20.1 valid /usr/bin/git √ YOUTUBEDL_BINARY v2021.04.26 valid /usr/local/bin/youtube-dl √ CHROME_BINARY v90.0.4430.93 valid /usr/bin/chromium √ RIPGREP_BINARY v0.10.0 valid /usr/bin/rg [i] Source-code locations: √ PACKAGE_DIR 22 files valid /app/archivebox √ TEMPLATES_DIR 3 files valid /app/archivebox/templates - CUSTOM_TEMPLATES_DIR - disabled [i] Secrets locations: - CHROME_USER_DATA_DIR - disabled - COOKIES_FILE - disabled [i] Data locations: √ OUTPUT_DIR 9 files valid /data √ SOURCES_DIR 40 files valid ./sources √ LOGS_DIR 1 files valid ./logs √ ARCHIVE_DIR 1268 files valid ./archive √ CONFIG_FILE 81.0 Bytes valid ./ArchiveBox.conf √ SQL_INDEX 11.4 MB valid ./index.sqlite3 ```
kerem closed this issue 2026-03-01 14:44:00 +03:00
Author
Owner

@pirate commented on GitHub (May 31, 2021):

You should report the bug of stdin not being passed (+ breaking of backwards compatibility with docker-compose) over on the Docker compose repo instead: https://github.com/docker/compose/issues/new/choose

It's not something ArchiveBox is able to control.

In the meantime, use the pip3 install docker-compose / docker-compose version instead.

<!-- gh-comment-id:851148630 --> @pirate commented on GitHub (May 31, 2021): You should report the bug of stdin not being passed (+ breaking of backwards compatibility with `docker-compose`) over on the Docker compose repo instead: https://github.com/docker/compose/issues/new/choose It's not something ArchiveBox is able to control. In the meantime, use the `pip3 install docker-compose` / `docker-compose` version instead.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#480
No description provided.