[GH-ISSUE #1167] Bug: Youtube Download Issues (Videos don't play) #726

Closed
opened 2026-03-01 14:45:51 +03:00 by kerem · 5 comments
Owner

Originally created by @taylor-schneider on GitHub (Jun 27, 2023).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1167

Describe the bug

Downloading a youtube video shows complete with no errors. The problem is that the singlefile.html and the wget related resources do not play the video (it just hangs). And the capture did not grab comments for the video. The media page has a webm file that will play, but the controls on the player do not work (scrolling or drag and drop do not work).

Steps to reproduce

  1. Docker pull the latests image (5b8da5097f7f)
  2. Add a youtube url
  3. Launch the server and try to play the video

Screenshots or log output

Here are the logs from adding a video

[root@localhost archivebox_test]# docker run -v $PWD:/data archivebox/archivebox add 'https://www.youtube.com/watch?v=AAR_9mJ7rEU'
find: '/.config/chromium/Crash Reports/pending/': No such file or directory
[i] [2023-06-27 14:44:44] ArchiveBox v0.6.3: archivebox add https://www.youtube.com/watch?v=AAR_9mJ7rEU
    > /data

find: '/.config/chromium/Crash Reports/pending/': No such file or directory
[+] [2023-06-27 14:44:45] Adding 1 links to index (crawl depth=0)...
    > Saved verbatim input to sources/1687877085-import.txt
    > Parsed 1 URLs from input (Generic TXT)
    > Found 1 new URLs not already in index

[*] [2023-06-27 14:44:45] Writing 1 links to main index...
    √ ./index.sqlite3

[*] [2023-06-27 14:44:45] Archiving 1/2 URLs from added set...

[▶] [2023-06-27 14:44:45] Starting archiving of 1 snapshots in index...

[+] [2023-06-27 14:44:45] "www.youtube.com/watch?v=AAR_9mJ7rEU"
    https://www.youtube.com/watch?v=AAR_9mJ7rEU
    > ./archive/1687877085.323637
      > favicon
      > headers
      > singlefile
      > pdf
      > screenshot
      > dom
      > wget
      > title
      > readability
      > mercury
      > media
      > archive_org
        124 files (127.3 MB) in 0:01:31s

[√] [2023-06-27 14:46:16] Update of 1 pages complete (1.52 min)
    - 0 links skipped
    - 1 links updated
    - 0 links had errors

    Hint: To manage your archive in a Web UI, run:
        archivebox server 0.0.0.0:8000

Here are the logs from launching the server, opening the webpage, clicking on the wget page, and clicking play in the resulting page:

[root@localhost archivebox_test]# docker run -v $PWD:/data -p 8000:8000 archivebox/archivebox server 0.0.0.0:8000
find: '/.config/chromium/Crash Reports/pending/': No such file or directory
[i] [2023-06-27 14:47:16] ArchiveBox v0.6.3: archivebox server 0.0.0.0:8000
    > /data

find: '/.config/chromium/Crash Reports/pending/': No such file or directory
[+] Starting ArchiveBox webserver...
    > Logging errors to ./logs/errors.log
Performing system checks...

System check identified no issues (0 silenced).
June 27, 2023 - 14:47:17
Django version 3.1.14, using settings 'core.settings'
Starting development server at http://0.0.0.0:8000/
Quit the server with CONTROL-C.
"GET / HTTP/1.1" 302 0
"GET /public/ HTTP/1.1" 200 10434
"GET /archive/1687877085.323637/index.html HTTP/1.1" 200 241255
"GET /archive/1687877085.323637/screenshot.png HTTP/1.1" 200 1331187
"GET /archive/1687877085.323637/headers.json HTTP/1.1" 200 2145
"GET /archive/1687877085.323637/mercury/content.html HTTP/1.1" 200 1337
Not Found: /archive/1687877085.323637/git/
"GET /archive/1687877085.323637/git/ HTTP/1.1" 404 1232
"GET /archive/1687877085.323637/media/ HTTP/1.1" 200 1089
"GET /archive/1687877085.323637/singlefile.html HTTP/1.1" 200 1553534
"GET /archive/1687877085.323637/output.pdf HTTP/1.1" 200 9600
"GET /archive/1687877085.323637/www.youtube.com/watch%40v%3DAAR_9mJ7rEU.html HTTP/1.1" 200 852461
"GET /archive/1687877085.323637/readability/content.html HTTP/1.1" 200 12350
"GET /archive/1687877085.323637/output.html HTTP/1.1" 200 3653733
"GET /archive/1687877085.323637/www.youtube.com/s/player/b7910ca8/player_ias.vflset/en_US/base.js HTTP/1.1" 200 2401178
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/endscreen.js
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/annotations_module.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/endscreen.js HTTP/1.1" 404 179
"GET /s/player/b7910ca8/player_ias.vflset/en_US/annotations_module.js HTTP/1.1" 404 179
"GET /archive/1687877085.323637/fonts.googleapis.com/css2@family=Roboto%253Awght@300%3B400%3B500%3B700&family=YouTube+Sans%253Awght@300..900&display=swap.css HTTP/1.1" 200 11231
Not Found: /s/player/b7910ca8/www-player.css
"GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/captions.js
"GET /archive/1687877085.323637/www.youtube.com/s/player/b7910ca8/www-player.css HTTP/1.1" 200 418754
"GET /s/player/b7910ca8/player_ias.vflset/en_US/captions.js HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/remote.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/remote.js HTTP/1.1" 404 179
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/cssbin/www-main-desktop-watch-page-skeleton.css HTTP/1.1" 200 13318
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/offline.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/offline.js HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/base.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/base.js HTTP/1.1" 404 179
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/cssbin/www-main-desktop-player-skeleton.css HTTP/1.1" 200 2182
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/cssbin/www-onepick.css HTTP/1.1" 200 1011
"GET /archive/1687877085.323637/www.youtube.com/s/_/ytmainappweb/_/ss/k=ytmainappweb.kevlar_base.NqayTKgDV74.L.B1.O/am=AAU/d=0/rs=AGKMywEYRxwoIs7keRsZ6YWhrzQXac6QIg.css HTTP/1.1" 200 94108
"GET /archive/1687877085.323637/i.ytimg.com/vi/AAR_9mJ7rEU/hqdefault.jpg HTTP/1.1" 200 40621
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/web-animations-next-lite.min.vflset/web-animations-next-lite.min.js HTTP/1.1" 200 50695
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/custom-elements-es5-adapter.vflset/custom-elements-es5-adapter.js HTTP/1.1" 200 1980
Not Found: /s/player/b7910ca8/www-player.css
"GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/webcomponents-sd.vflset/webcomponents-sd.js HTTP/1.1" 200 78637
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/desktop_polymer.vflset/desktop_polymer.js HTTP/1.1" 200 9867053
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/intersection-observer.min.vflset/intersection-observer.min.js HTTP/1.1" 200 5547
Not Found: /s/search/audio/success.mp3
"GET /s/search/audio/success.mp3 HTTP/1.1" 404 179
Not Found: /s/search/audio/open.mp3
"GET /s/search/audio/open.mp3 HTTP/1.1" 404 179
Not Found: /s/search/audio/no_input.mp3
"GET /s/search/audio/no_input.mp3 HTTP/1.1" 404 179
Not Found: /s/search/audio/failure.mp3
"GET /s/search/audio/failure.mp3 HTTP/1.1" 404 179
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/scheduler.vflset/scheduler.js HTTP/1.1" 200 9685
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/www-i18n-constants-en_US.vflset/www-i18n-constants.js HTTP/1.1" 200 5863
"GET /archive/1687877085.323637/fonts.gstatic.com/s/roboto/v30/KFOmCnqEu92Fr1Mu4mxKKTU1Kg.woff2 HTTP/1.1" 200 11028
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/spf.vflset/spf.js HTTP/1.1" 200 41678
"GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/network.vflset/network.js HTTP/1.1" 200 16626
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/offline.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/offline.js HTTP/1.1" 404 179
"GET /archive/1687877085.323637/accounts.google.com/ServiceLogin@service=youtube&uilel=3&passive=true&continue=https%253A%252F%252Fwww.youtube.com%252Fsignin%253Faction_handle_signin=true&app=desktop&hl=en&next=%252Fsignin_passive&feature=passive&hl=en.html HTTP/1.1" 200 583806
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/remote.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/remote.js HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js HTTP/1.1" 404 179
"GET /archive/1687877085.323637/www.gstatic.com/_/mss/boq-identity/_/js/k=boq-identity.AccountsSignInUi.en.62Pt1FXbous.es5.O/am=XxxgjkMPHMT_VOdnnBwGAAAAAAAAAABaGOYE/d=1/excm=_b,_r,_tp,identifierview/ed=1/dg=0/wt=2/ujg=1/rs=AOaEmlEPagU10zXs_96UZzjaythSSDf3Sw/m=_b,_tp,_ HTTP/1.1" 200 213179
Not Found: /youtubei/v1/att/get
"POST /youtubei/v1/att/get?key=REDACTED&prettyPrint=false HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/base.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/base.js HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/www-player.css
"GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179
Not Found: /s/search/audio/open.mp3
"GET /s/search/audio/open.mp3 HTTP/1.1" 404 179
Not Found: /s/search/audio/success.mp3
Not Found: /s/search/audio/no_input.mp3
"GET /s/search/audio/no_input.mp3 HTTP/1.1" 404 179
Not Found: /s/search/audio/failure.mp3
"GET /s/search/audio/failure.mp3 HTTP/1.1" 404 179
"GET /s/search/audio/success.mp3 HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/www-player.css
"GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179
"GET /archive/1687877085.323637/fonts.gstatic.com/s/roboto/v30/KFOlCnqEu92Fr1MmEU9fBBc4AMP6lQ.woff2 HTTP/1.1" 200 11072
"GET /archive/1687877085.323637/fonts.gstatic.com/s/youtubesans/v18/Qw38ZQNGEDjaO2m6tqIqX5E-AVS5_rSejo46_PCTRspJ0OosolrBEJL3HO_T7fHoCVHx.woff2 HTTP/1.1" 200 25052
Not Found: /youtubei/v1/feedback
"POST /youtubei/v1/feedback?key=REDACTED&prettyPrint=false HTTP/1.1" 404 179
"GET /archive/1687877085.323637/www.youtube.com/watch%40v%3DAAR_9mJ7rEU.html HTTP/1.1" 200 852461
Not Found: /youtubei/v1/log_event
"POST /youtubei/v1/log_event?alt=json&key=REDACTED HTTP/1.1" 404 179
Not Found: /generate_204
"HEAD /generate_204 HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/offline.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/offline.js HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/remote.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/remote.js HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js HTTP/1.1" 404 179
Not Found: /youtubei/v1/att/get
"POST /youtubei/v1/att/get?key=REDACTED&prettyPrint=false HTTP/1.1" 404 179
Not Found: /youtubei/v1/log_event
Not Found: /s/search/audio/failure.mp3
"GET /s/search/audio/failure.mp3 HTTP/1.1" 404 179
Not Found: /s/search/audio/open.mp3
"GET /s/search/audio/open.mp3 HTTP/1.1" 404 179
Not Found: /s/search/audio/no_input.mp3
"GET /s/search/audio/no_input.mp3 HTTP/1.1" 404 179
Not Found: /s/search/audio/success.mp3
"GET /s/search/audio/success.mp3 HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/www-player.css
"GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179
"GET /archive/1687877085.323637/fonts.gstatic.com/s/roboto/v30/KFOmCnqEu92Fr1Mu7GxKKTU1Kvnz.woff2 HTTP/1.1" 200 7736
"GET /archive/1687877085.323637/fonts.gstatic.com/s/roboto/v30/KFOlCnqEu92Fr1MmEU9fChc4AMP6lbBP.woff2 HTTP/1.1" 200 7676
"POST /youtubei/v1/log_event?alt=json&key=REDACTED HTTP/1.1" 404 179
Not Found: /youtubei/v1/log_event
Not Found: /generate_204
"HEAD /generate_204 HTTP/1.1" 404 179
"POST /youtubei/v1/log_event?alt=json&key=REDACTED HTTP/1.1" 404 179
Not Found: /generate_204
"HEAD /generate_204 HTTP/1.1" 404 179
Not Found: /youtubei/v1/log_event
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/annotations_module.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/annotations_module.js HTTP/1.1" 404 179
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/captions.js
Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/endscreen.js
"GET /s/player/b7910ca8/player_ias.vflset/en_US/captions.js HTTP/1.1" 404 179
"GET /s/player/b7910ca8/player_ias.vflset/en_US/endscreen.js HTTP/1.1" 404 179
"POST /youtubei/v1/log_event?alt=json&key=REDACTED HTTP/1.1" 404 179
Not Found: /generate_204
"HEAD /generate_204 HTTP/1.1" 404 179

ArchiveBox version

[root@localhost archivebox_test]# docker run -v $PWD:/data -p 8000:8000 archivebox/archivebox version
find: '/.config/chromium/Crash Reports/pending/': No such file or directory
0.6.3
ArchiveBox v0.6.3 Cpython Linux Linux-3.10.0-1160.25.1.el7.x86_64-x86_64-with-glibc2.31 x86_64
DEBUG=False IN_DOCKER=True IS_TTY=False TZ=UTC FS_ATOMIC=True FS_REMOTE=True FS_PERMS=644 999:999 SEARCH_BACKEND=ripgrep

[i] Dependency versions:
 √  PYTHON_BINARY         v3.11.3         valid     /usr/local/bin/python3.11                                  
 √  SQLITE_BINARY         v2.6.0          valid     /usr/local/lib/python3.11/sqlite3/dbapi2.py                
 √  DJANGO_BINARY         v3.1.14         valid     /usr/local/lib/python3.11/site-packages/django/__init__.py 
 √  ARCHIVEBOX_BINARY     v0.6.3          valid     /usr/local/bin/archivebox                                  

 √  CURL_BINARY           v7.74.0         valid     /usr/bin/curl                                              
 √  WGET_BINARY           v1.21           valid     /usr/bin/wget                                              
 √  NODE_BINARY           v18.16.0        valid     /usr/bin/node                                              
 √  SINGLEFILE_BINARY     v0.3.16         valid     /node/node_modules/single-file/cli/single-file             
 √  READABILITY_BINARY    v0.0.2          valid     /node/node_modules/readability-extractor/readability-extractor
 √  MERCURY_BINARY        v1.0.0          valid     /node/node_modules/@postlight/mercury-parser/cli.js        
 √  GIT_BINARY            v2.30.2         valid     /usr/bin/git                                               
 √  YOUTUBEDL_BINARY      v2023.03.04     valid     /usr/local/bin/yt-dlp                                      
 √  CHROME_BINARY         v112.0.5615.138  valid     /usr/bin/chromium                                         
 √  RIPGREP_BINARY        v12.1.1         valid     /usr/bin/rg                                                

[i] Source-code locations:
 √  PACKAGE_DIR           22 files        valid     /app/archivebox                                            
 √  TEMPLATES_DIR         3 files         valid     /app/archivebox/templates                                  
 -  CUSTOM_TEMPLATES_DIR  -               disabled                                                             

[i] Secrets locations:
 -  CHROME_USER_DATA_DIR  -               disabled                                                             
 -  COOKIES_FILE          -               disabled                                                             

[i] Data locations:
 √  OUTPUT_DIR            5 files @       valid     /data                                                      
 √  SOURCES_DIR           2 files         valid     ./sources                                                  
 √  LOGS_DIR              1 files         valid     ./logs                                                     
 √  ARCHIVE_DIR           2 files         valid     ./archive                                                  
 √  CONFIG_FILE           81.0 Bytes      valid     ./ArchiveBox.conf                                          
 √  SQL_INDEX             220.0 KB        valid     ./index.sqlite3                                            

Originally created by @taylor-schneider on GitHub (Jun 27, 2023). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1167 <!-- Please fill out the following information, feel free to delete sections if they're not applicable or if long issue templates annoy you. (the only required section is the version information) --> #### Describe the bug Downloading a youtube video shows complete with no errors. The problem is that the singlefile.html and the wget related resources do not play the video (it just hangs). And the capture did not grab comments for the video. The media page has a webm file that will play, but the controls on the player do not work (scrolling or drag and drop do not work). #### Steps to reproduce 1. Docker pull the latests image (5b8da5097f7f) 2. Add a youtube url 3. Launch the server and try to play the video #### Screenshots or log output Here are the logs from adding a video ``` [root@localhost archivebox_test]# docker run -v $PWD:/data archivebox/archivebox add 'https://www.youtube.com/watch?v=AAR_9mJ7rEU' find: '/.config/chromium/Crash Reports/pending/': No such file or directory [i] [2023-06-27 14:44:44] ArchiveBox v0.6.3: archivebox add https://www.youtube.com/watch?v=AAR_9mJ7rEU > /data find: '/.config/chromium/Crash Reports/pending/': No such file or directory [+] [2023-06-27 14:44:45] Adding 1 links to index (crawl depth=0)... > Saved verbatim input to sources/1687877085-import.txt > Parsed 1 URLs from input (Generic TXT) > Found 1 new URLs not already in index [*] [2023-06-27 14:44:45] Writing 1 links to main index... √ ./index.sqlite3 [*] [2023-06-27 14:44:45] Archiving 1/2 URLs from added set... [▶] [2023-06-27 14:44:45] Starting archiving of 1 snapshots in index... [+] [2023-06-27 14:44:45] "www.youtube.com/watch?v=AAR_9mJ7rEU" https://www.youtube.com/watch?v=AAR_9mJ7rEU > ./archive/1687877085.323637 > favicon > headers > singlefile > pdf > screenshot > dom > wget > title > readability > mercury > media > archive_org 124 files (127.3 MB) in 0:01:31s [√] [2023-06-27 14:46:16] Update of 1 pages complete (1.52 min) - 0 links skipped - 1 links updated - 0 links had errors Hint: To manage your archive in a Web UI, run: archivebox server 0.0.0.0:8000 ``` Here are the logs from launching the server, opening the webpage, clicking on the wget page, and clicking play in the resulting page: ``` [root@localhost archivebox_test]# docker run -v $PWD:/data -p 8000:8000 archivebox/archivebox server 0.0.0.0:8000 find: '/.config/chromium/Crash Reports/pending/': No such file or directory [i] [2023-06-27 14:47:16] ArchiveBox v0.6.3: archivebox server 0.0.0.0:8000 > /data find: '/.config/chromium/Crash Reports/pending/': No such file or directory [+] Starting ArchiveBox webserver... > Logging errors to ./logs/errors.log Performing system checks... System check identified no issues (0 silenced). June 27, 2023 - 14:47:17 Django version 3.1.14, using settings 'core.settings' Starting development server at http://0.0.0.0:8000/ Quit the server with CONTROL-C. "GET / HTTP/1.1" 302 0 "GET /public/ HTTP/1.1" 200 10434 "GET /archive/1687877085.323637/index.html HTTP/1.1" 200 241255 "GET /archive/1687877085.323637/screenshot.png HTTP/1.1" 200 1331187 "GET /archive/1687877085.323637/headers.json HTTP/1.1" 200 2145 "GET /archive/1687877085.323637/mercury/content.html HTTP/1.1" 200 1337 Not Found: /archive/1687877085.323637/git/ "GET /archive/1687877085.323637/git/ HTTP/1.1" 404 1232 "GET /archive/1687877085.323637/media/ HTTP/1.1" 200 1089 "GET /archive/1687877085.323637/singlefile.html HTTP/1.1" 200 1553534 "GET /archive/1687877085.323637/output.pdf HTTP/1.1" 200 9600 "GET /archive/1687877085.323637/www.youtube.com/watch%40v%3DAAR_9mJ7rEU.html HTTP/1.1" 200 852461 "GET /archive/1687877085.323637/readability/content.html HTTP/1.1" 200 12350 "GET /archive/1687877085.323637/output.html HTTP/1.1" 200 3653733 "GET /archive/1687877085.323637/www.youtube.com/s/player/b7910ca8/player_ias.vflset/en_US/base.js HTTP/1.1" 200 2401178 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/endscreen.js Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/annotations_module.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/endscreen.js HTTP/1.1" 404 179 "GET /s/player/b7910ca8/player_ias.vflset/en_US/annotations_module.js HTTP/1.1" 404 179 "GET /archive/1687877085.323637/fonts.googleapis.com/css2@family=Roboto%253Awght@300%3B400%3B500%3B700&family=YouTube+Sans%253Awght@300..900&display=swap.css HTTP/1.1" 200 11231 Not Found: /s/player/b7910ca8/www-player.css "GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/captions.js "GET /archive/1687877085.323637/www.youtube.com/s/player/b7910ca8/www-player.css HTTP/1.1" 200 418754 "GET /s/player/b7910ca8/player_ias.vflset/en_US/captions.js HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/remote.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/remote.js HTTP/1.1" 404 179 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/cssbin/www-main-desktop-watch-page-skeleton.css HTTP/1.1" 200 13318 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/offline.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/offline.js HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/base.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/base.js HTTP/1.1" 404 179 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/cssbin/www-main-desktop-player-skeleton.css HTTP/1.1" 200 2182 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/cssbin/www-onepick.css HTTP/1.1" 200 1011 "GET /archive/1687877085.323637/www.youtube.com/s/_/ytmainappweb/_/ss/k=ytmainappweb.kevlar_base.NqayTKgDV74.L.B1.O/am=AAU/d=0/rs=AGKMywEYRxwoIs7keRsZ6YWhrzQXac6QIg.css HTTP/1.1" 200 94108 "GET /archive/1687877085.323637/i.ytimg.com/vi/AAR_9mJ7rEU/hqdefault.jpg HTTP/1.1" 200 40621 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/web-animations-next-lite.min.vflset/web-animations-next-lite.min.js HTTP/1.1" 200 50695 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/custom-elements-es5-adapter.vflset/custom-elements-es5-adapter.js HTTP/1.1" 200 1980 Not Found: /s/player/b7910ca8/www-player.css "GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/webcomponents-sd.vflset/webcomponents-sd.js HTTP/1.1" 200 78637 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/desktop_polymer.vflset/desktop_polymer.js HTTP/1.1" 200 9867053 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/intersection-observer.min.vflset/intersection-observer.min.js HTTP/1.1" 200 5547 Not Found: /s/search/audio/success.mp3 "GET /s/search/audio/success.mp3 HTTP/1.1" 404 179 Not Found: /s/search/audio/open.mp3 "GET /s/search/audio/open.mp3 HTTP/1.1" 404 179 Not Found: /s/search/audio/no_input.mp3 "GET /s/search/audio/no_input.mp3 HTTP/1.1" 404 179 Not Found: /s/search/audio/failure.mp3 "GET /s/search/audio/failure.mp3 HTTP/1.1" 404 179 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/scheduler.vflset/scheduler.js HTTP/1.1" 200 9685 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/www-i18n-constants-en_US.vflset/www-i18n-constants.js HTTP/1.1" 200 5863 "GET /archive/1687877085.323637/fonts.gstatic.com/s/roboto/v30/KFOmCnqEu92Fr1Mu4mxKKTU1Kg.woff2 HTTP/1.1" 200 11028 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/spf.vflset/spf.js HTTP/1.1" 200 41678 "GET /archive/1687877085.323637/www.youtube.com/s/desktop/0e9d1cf9/jsbin/network.vflset/network.js HTTP/1.1" 200 16626 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/offline.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/offline.js HTTP/1.1" 404 179 "GET /archive/1687877085.323637/accounts.google.com/ServiceLogin@service=youtube&uilel=3&passive=true&continue=https%253A%252F%252Fwww.youtube.com%252Fsignin%253Faction_handle_signin=true&app=desktop&hl=en&next=%252Fsignin_passive&feature=passive&hl=en.html HTTP/1.1" 200 583806 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/remote.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/remote.js HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js HTTP/1.1" 404 179 "GET /archive/1687877085.323637/www.gstatic.com/_/mss/boq-identity/_/js/k=boq-identity.AccountsSignInUi.en.62Pt1FXbous.es5.O/am=XxxgjkMPHMT_VOdnnBwGAAAAAAAAAABaGOYE/d=1/excm=_b,_r,_tp,identifierview/ed=1/dg=0/wt=2/ujg=1/rs=AOaEmlEPagU10zXs_96UZzjaythSSDf3Sw/m=_b,_tp,_ HTTP/1.1" 200 213179 Not Found: /youtubei/v1/att/get "POST /youtubei/v1/att/get?key=REDACTED&prettyPrint=false HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/base.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/base.js HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/www-player.css "GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179 Not Found: /s/search/audio/open.mp3 "GET /s/search/audio/open.mp3 HTTP/1.1" 404 179 Not Found: /s/search/audio/success.mp3 Not Found: /s/search/audio/no_input.mp3 "GET /s/search/audio/no_input.mp3 HTTP/1.1" 404 179 Not Found: /s/search/audio/failure.mp3 "GET /s/search/audio/failure.mp3 HTTP/1.1" 404 179 "GET /s/search/audio/success.mp3 HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/www-player.css "GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179 "GET /archive/1687877085.323637/fonts.gstatic.com/s/roboto/v30/KFOlCnqEu92Fr1MmEU9fBBc4AMP6lQ.woff2 HTTP/1.1" 200 11072 "GET /archive/1687877085.323637/fonts.gstatic.com/s/youtubesans/v18/Qw38ZQNGEDjaO2m6tqIqX5E-AVS5_rSejo46_PCTRspJ0OosolrBEJL3HO_T7fHoCVHx.woff2 HTTP/1.1" 200 25052 Not Found: /youtubei/v1/feedback "POST /youtubei/v1/feedback?key=REDACTED&prettyPrint=false HTTP/1.1" 404 179 "GET /archive/1687877085.323637/www.youtube.com/watch%40v%3DAAR_9mJ7rEU.html HTTP/1.1" 200 852461 Not Found: /youtubei/v1/log_event "POST /youtubei/v1/log_event?alt=json&key=REDACTED HTTP/1.1" 404 179 Not Found: /generate_204 "HEAD /generate_204 HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/offline.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/offline.js HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/remote.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/remote.js HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/miniplayer.js HTTP/1.1" 404 179 Not Found: /youtubei/v1/att/get "POST /youtubei/v1/att/get?key=REDACTED&prettyPrint=false HTTP/1.1" 404 179 Not Found: /youtubei/v1/log_event Not Found: /s/search/audio/failure.mp3 "GET /s/search/audio/failure.mp3 HTTP/1.1" 404 179 Not Found: /s/search/audio/open.mp3 "GET /s/search/audio/open.mp3 HTTP/1.1" 404 179 Not Found: /s/search/audio/no_input.mp3 "GET /s/search/audio/no_input.mp3 HTTP/1.1" 404 179 Not Found: /s/search/audio/success.mp3 "GET /s/search/audio/success.mp3 HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/www-player.css "GET /s/player/b7910ca8/www-player.css HTTP/1.1" 404 179 "GET /archive/1687877085.323637/fonts.gstatic.com/s/roboto/v30/KFOmCnqEu92Fr1Mu7GxKKTU1Kvnz.woff2 HTTP/1.1" 200 7736 "GET /archive/1687877085.323637/fonts.gstatic.com/s/roboto/v30/KFOlCnqEu92Fr1MmEU9fChc4AMP6lbBP.woff2 HTTP/1.1" 200 7676 "POST /youtubei/v1/log_event?alt=json&key=REDACTED HTTP/1.1" 404 179 Not Found: /youtubei/v1/log_event Not Found: /generate_204 "HEAD /generate_204 HTTP/1.1" 404 179 "POST /youtubei/v1/log_event?alt=json&key=REDACTED HTTP/1.1" 404 179 Not Found: /generate_204 "HEAD /generate_204 HTTP/1.1" 404 179 Not Found: /youtubei/v1/log_event Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/annotations_module.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/annotations_module.js HTTP/1.1" 404 179 Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/captions.js Not Found: /s/player/b7910ca8/player_ias.vflset/en_US/endscreen.js "GET /s/player/b7910ca8/player_ias.vflset/en_US/captions.js HTTP/1.1" 404 179 "GET /s/player/b7910ca8/player_ias.vflset/en_US/endscreen.js HTTP/1.1" 404 179 "POST /youtubei/v1/log_event?alt=json&key=REDACTED HTTP/1.1" 404 179 Not Found: /generate_204 "HEAD /generate_204 HTTP/1.1" 404 179 ``` #### ArchiveBox version ```logs [root@localhost archivebox_test]# docker run -v $PWD:/data -p 8000:8000 archivebox/archivebox version find: '/.config/chromium/Crash Reports/pending/': No such file or directory 0.6.3 ArchiveBox v0.6.3 Cpython Linux Linux-3.10.0-1160.25.1.el7.x86_64-x86_64-with-glibc2.31 x86_64 DEBUG=False IN_DOCKER=True IS_TTY=False TZ=UTC FS_ATOMIC=True FS_REMOTE=True FS_PERMS=644 999:999 SEARCH_BACKEND=ripgrep [i] Dependency versions: √ PYTHON_BINARY v3.11.3 valid /usr/local/bin/python3.11 √ SQLITE_BINARY v2.6.0 valid /usr/local/lib/python3.11/sqlite3/dbapi2.py √ DJANGO_BINARY v3.1.14 valid /usr/local/lib/python3.11/site-packages/django/__init__.py √ ARCHIVEBOX_BINARY v0.6.3 valid /usr/local/bin/archivebox √ CURL_BINARY v7.74.0 valid /usr/bin/curl √ WGET_BINARY v1.21 valid /usr/bin/wget √ NODE_BINARY v18.16.0 valid /usr/bin/node √ SINGLEFILE_BINARY v0.3.16 valid /node/node_modules/single-file/cli/single-file √ READABILITY_BINARY v0.0.2 valid /node/node_modules/readability-extractor/readability-extractor √ MERCURY_BINARY v1.0.0 valid /node/node_modules/@postlight/mercury-parser/cli.js √ GIT_BINARY v2.30.2 valid /usr/bin/git √ YOUTUBEDL_BINARY v2023.03.04 valid /usr/local/bin/yt-dlp √ CHROME_BINARY v112.0.5615.138 valid /usr/bin/chromium √ RIPGREP_BINARY v12.1.1 valid /usr/bin/rg [i] Source-code locations: √ PACKAGE_DIR 22 files valid /app/archivebox √ TEMPLATES_DIR 3 files valid /app/archivebox/templates - CUSTOM_TEMPLATES_DIR - disabled [i] Secrets locations: - CHROME_USER_DATA_DIR - disabled - COOKIES_FILE - disabled [i] Data locations: √ OUTPUT_DIR 5 files @ valid /data √ SOURCES_DIR 2 files valid ./sources √ LOGS_DIR 1 files valid ./logs √ ARCHIVE_DIR 2 files valid ./archive √ CONFIG_FILE 81.0 Bytes valid ./ArchiveBox.conf √ SQL_INDEX 220.0 KB valid ./index.sqlite3 ```
kerem closed this issue 2026-03-01 14:45:51 +03:00
Author
Owner

@pirate commented on GitHub (Jun 28, 2023):

This is expected behavior, singlefile does not run the video player js / rewrite it to request from your local archive, so most non-native players don't work in singlefile archives, but you can go to the ./archive/<timestmap>/media folder to find the webm/mp4/mp3 media files extracted by youtube-dl/yt-dlp.

If you need higher fidelity archiving with inline videos working embedded in the archived page, I recommend ArchiveWeb.page / WebRecorder.net instead. (which we'd love to integrate with in the future)

<!-- gh-comment-id:1610989234 --> @pirate commented on GitHub (Jun 28, 2023): This is expected behavior, singlefile does not run the video player js / rewrite it to request from your local archive, so most non-native players don't work in singlefile archives, but you can go to the `./archive/<timestmap>/media` folder to find the webm/mp4/mp3 media files extracted by youtube-dl/yt-dlp. If you need higher fidelity archiving with inline videos working embedded in the archived page, I recommend ArchiveWeb.page / WebRecorder.net instead. (which we'd love to integrate with in the future)
Author
Owner

@melyux commented on GitHub (Jul 12, 2023):

It would be quite rad to have the embedded videos actually work on the Wget downloads at least, and Singlefile. Right now the media just sits in the folder but is not accessible.

<!-- gh-comment-id:1633104429 --> @melyux commented on GitHub (Jul 12, 2023): It would be quite rad to have the embedded videos actually work on the Wget downloads at least, and Singlefile. Right now the media just sits in the folder but is not accessible.
Author
Owner

@pirate commented on GitHub (Jul 18, 2023):

You should be able to click the files in the folder listing displayed in the media area of the snapshot details page and your browser will play the videos. Embedding the video back in the original page is beyond the scope of what I'm willing to implement.

<!-- gh-comment-id:1641125874 --> @pirate commented on GitHub (Jul 18, 2023): You should be able to click the files in the folder listing displayed in the media area of the snapshot details page and your browser will play the videos. Embedding the video back in the original page is beyond the scope of what I'm willing to implement.
Author
Owner

@melyux commented on GitHub (Jul 19, 2023):

I get it. One of the advantages of this project over Web Archive is the lack of space and rate limiting constraints, so it makes it actually possible to archive a page completely, including embedded videos/audios, without relying on outside resources. Could be a cool thing to have

<!-- gh-comment-id:1641572932 --> @melyux commented on GitHub (Jul 19, 2023): I get it. One of the advantages of this project over Web Archive is the lack of space and rate limiting constraints, so it makes it actually possible to archive a page completely, including embedded videos/audios, without relying on outside resources. Could be a cool thing to have
Author
Owner

@pirate commented on GitHub (Oct 19, 2023):

Embedded videos do work with my friends projects here, you should check them out if you need that: https://webrecorder.net/

They did an incredible job with high fidelity preservation, so you can even replay YouTube right in the native player interface. Might integrate with their engine someday but for now I'm focusing more on "digesting content" out of pages (slightly easier) instead of preserving it perfectly in its original format (a much harder problem).

<!-- gh-comment-id:1771493370 --> @pirate commented on GitHub (Oct 19, 2023): Embedded videos do work with my friends projects here, you should check them out if you need that: https://webrecorder.net/ They did an incredible job with high fidelity preservation, so you can even replay YouTube right in the native player interface. Might integrate with their engine someday but for now I'm focusing more on "digesting content" out of pages (slightly easier) instead of preserving it perfectly in its original format (a much harder problem).
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#726
No description provided.