[GH-ISSUE #1037] Bug: timeout 1m apply all task #2159

Closed
opened 2026-03-01 17:56:57 +03:00 by kerem · 2 comments
Owner

Originally created by @green1052 on GitHub (Oct 13, 2022).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1037

Describe the bug

after 1m (TIMEOUT default value) fail all singlefile task

Steps to reproduce

1.add long urls..
2. start task
3. after 1m (TIMEOUT default value) fail all singlefile task

Screenshots or log output

SingleFile was not able to archive the page

ArchiveBox version

ArchiveBox v0.6.2
Cpython Linux Linux-5.15.0-1019-oracle-aarch64-with-glibc2.35 aarch64
IN_DOCKER=False DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND_ENGINE=ripgrep

[i] Dependency versions:
 √  ARCHIVEBOX_BINARY     v0.6.2          valid     /usr/local/bin/archivebox   
 √  PYTHON_BINARY         v3.10.6         valid     /usr/bin/python3.10         
 √  DJANGO_BINARY         v3.1.14         valid     /usr/local/lib/python3.10/dist-packages/django/bin/django-admin.py
 √  CURL_BINARY           v7.81.0         valid     /usr/bin/curl               
 √  WGET_BINARY           v1.21.2         valid     /usr/bin/wget               
 √  NODE_BINARY           v12.22.9        valid     /usr/bin/node               
 √  SINGLEFILE_BINARY     v1.0.13         valid     ./node_modules/single-file/cli/single-file
 -  READABILITY_BINARY    -               disabled  ./node_modules/readability-extractor/readability-extractor
 -  MERCURY_BINARY        -               disabled  ./node_modules/@postlight/mercury-parser/cli.js
 -  GIT_BINARY            -               disabled  /usr/bin/git                
 √  YOUTUBEDL_BINARY      v2021.12.17     valid     /usr/local/bin/youtube-dl   
 √  CHROME_BINARY         v106.0.5249.91  valid     /usr/bin/chromium-browser   
 √  RIPGREP_BINARY        v13.0.0         valid     /usr/bin/rg                 

[i] Source-code locations:
 √  PACKAGE_DIR           23 files        valid     /usr/local/lib/python3.10/dist-packages/archivebox
 √  TEMPLATES_DIR         3 files         valid     /usr/local/lib/python3.10/dist-packages/archivebox/templates
 -  CUSTOM_TEMPLATES_DIR  -               disabled                              

[i] Secrets locations:
 √  CHROME_USER_DATA_DIR  32 files        valid     ./chrome-profile            
 -  COOKIES_FILE          -               disabled                              

[i] Data locations:
 √  OUTPUT_DIR            9 files         valid     /home/ubuntu/archivebox     
 √  SOURCES_DIR           3 files         valid     ./sources                   
 √  LOGS_DIR              1 files         valid     ./logs                      
 √  ARCHIVE_DIR           668 files       valid     ./archive                   
 √  CONFIG_FILE           685.0 Bytes     valid     ./ArchiveBox.conf           
 √  SQL_INDEX             24.7 MB         valid     ./index.sqlite3             


Originally created by @green1052 on GitHub (Oct 13, 2022). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1037 <!-- Please fill out the following information, feel free to delete sections if they're not applicable or if long issue templates annoy you. (the only required section is the version information) --> #### Describe the bug <!-- A description of what the bug is, what you expected to happen, and any relevant context about issue. --> after 1m (TIMEOUT default value) fail all singlefile task #### Steps to reproduce <!-- For example: 1. Ran ArchiveBox with the following config '...' 2. Saw this output during archiving '....' 5. UI didn't show the thing I was expecting '....' --> 1.add long urls.. 2. start task 3. after 1m (TIMEOUT default value) fail all singlefile task #### Screenshots or log output <!-- If applicable, post any relevant screenshots or copy/pasted terminal output from ArchiveBox. If you're reporting a parsing / importing error, **you must paste a copy of your redacted import file here**. --> SingleFile was not able to archive the page #### ArchiveBox version <!-- Run the `archivebox version` command locally then copy paste the result here: --> ```logs ArchiveBox v0.6.2 Cpython Linux Linux-5.15.0-1019-oracle-aarch64-with-glibc2.35 aarch64 IN_DOCKER=False DEBUG=False IS_TTY=True TZ=UTC SEARCH_BACKEND_ENGINE=ripgrep [i] Dependency versions: √ ARCHIVEBOX_BINARY v0.6.2 valid /usr/local/bin/archivebox √ PYTHON_BINARY v3.10.6 valid /usr/bin/python3.10 √ DJANGO_BINARY v3.1.14 valid /usr/local/lib/python3.10/dist-packages/django/bin/django-admin.py √ CURL_BINARY v7.81.0 valid /usr/bin/curl √ WGET_BINARY v1.21.2 valid /usr/bin/wget √ NODE_BINARY v12.22.9 valid /usr/bin/node √ SINGLEFILE_BINARY v1.0.13 valid ./node_modules/single-file/cli/single-file - READABILITY_BINARY - disabled ./node_modules/readability-extractor/readability-extractor - MERCURY_BINARY - disabled ./node_modules/@postlight/mercury-parser/cli.js - GIT_BINARY - disabled /usr/bin/git √ YOUTUBEDL_BINARY v2021.12.17 valid /usr/local/bin/youtube-dl √ CHROME_BINARY v106.0.5249.91 valid /usr/bin/chromium-browser √ RIPGREP_BINARY v13.0.0 valid /usr/bin/rg [i] Source-code locations: √ PACKAGE_DIR 23 files valid /usr/local/lib/python3.10/dist-packages/archivebox √ TEMPLATES_DIR 3 files valid /usr/local/lib/python3.10/dist-packages/archivebox/templates - CUSTOM_TEMPLATES_DIR - disabled [i] Secrets locations: √ CHROME_USER_DATA_DIR 32 files valid ./chrome-profile - COOKIES_FILE - disabled [i] Data locations: √ OUTPUT_DIR 9 files valid /home/ubuntu/archivebox √ SOURCES_DIR 3 files valid ./sources √ LOGS_DIR 1 files valid ./logs √ ARCHIVE_DIR 668 files valid ./archive √ CONFIG_FILE 685.0 Bytes valid ./ArchiveBox.conf √ SQL_INDEX 24.7 MB valid ./index.sqlite3 ``` <!-- Tickets without full version info will closed until it is provided, we need the full output here to help you solve your issue -->
kerem closed this issue 2026-03-01 17:56:57 +03:00
Author
Owner

@green1052 commented on GitHub (Oct 13, 2022):

but cli is fine

<!-- gh-comment-id:1277316016 --> @green1052 commented on GitHub (Oct 13, 2022): but cli is fine
Author
Owner

@pirate commented on GitHub (Oct 27, 2022):

Can you post the output from ./logs/* when the web-submitted tasks fail.

<!-- gh-comment-id:1293535924 --> @pirate commented on GitHub (Oct 27, 2022): Can you post the output from `./logs/*` when the web-submitted tasks fail.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2159
No description provided.