[GH-ISSUE #677] Question: Unable to add bare domain without http:// or https:// in front #425

Closed
opened 2026-03-01 14:43:28 +03:00 by kerem · 1 comment
Owner

Originally created by @Leontking on GitHub (Mar 27, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/677

PS E:\ArchiveBox> archivebox add 'youtube.com'
[i] [2021-03-27 04:36:40] ArchiveBox v0.5.4: archivebox add youtube.com
    > E:\ArchiveBox

[!] Warning: Missing 6 recommended dependencies
    ! WGET_BINARY: wget (unable to detect version)
    ! SINGLEFILE_BINARY: single-file (unable to detect version)
      Hint: npm install --prefix . "git+https://github.com/ArchiveBox/ArchiveBox.git"
            or archivebox config --set SAVE_SINGLEFILE=False to silence this warning

    ! READABILITY_BINARY: readability-extractor (unable to detect version)
      Hint: npm install --prefix . "git+https://github.com/ArchiveBox/ArchiveBox.git"
            or archivebox config --set SAVE_READABILITY=False to silence this warning

    ! MERCURY_BINARY: mercury-parser (unable to detect version)
      Hint: npm install --prefix . "git+https://github.com/ArchiveBox/ArchiveBox.git"
            or archivebox config --set SAVE_MERCURY=False to silence this warning

    ! CHROME_BINARY: unable to find binary (unable to detect version)
    ! RIPGREP_BINARY: rg (unable to detect version)

[+] [2021-03-27 04:36:44] Adding 1 links to index (crawl depth=0)...
    > Saved verbatim input to sources/E:\ArchiveBox\sources\1616819804-import.txt
    > Parsed 0 URLs from input (Failed to parse)
    > Found 0 new URLs not already in index

[*] [2021-03-27 04:36:44] Writing 0 links to main index...
    √ E:\ArchiveBox\index.sqlite3
Originally created by @Leontking on GitHub (Mar 27, 2021). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/677 ```batch PS E:\ArchiveBox> archivebox add 'youtube.com' [i] [2021-03-27 04:36:40] ArchiveBox v0.5.4: archivebox add youtube.com > E:\ArchiveBox [!] Warning: Missing 6 recommended dependencies ! WGET_BINARY: wget (unable to detect version) ! SINGLEFILE_BINARY: single-file (unable to detect version) Hint: npm install --prefix . "git+https://github.com/ArchiveBox/ArchiveBox.git" or archivebox config --set SAVE_SINGLEFILE=False to silence this warning ! READABILITY_BINARY: readability-extractor (unable to detect version) Hint: npm install --prefix . "git+https://github.com/ArchiveBox/ArchiveBox.git" or archivebox config --set SAVE_READABILITY=False to silence this warning ! MERCURY_BINARY: mercury-parser (unable to detect version) Hint: npm install --prefix . "git+https://github.com/ArchiveBox/ArchiveBox.git" or archivebox config --set SAVE_MERCURY=False to silence this warning ! CHROME_BINARY: unable to find binary (unable to detect version) ! RIPGREP_BINARY: rg (unable to detect version) [+] [2021-03-27 04:36:44] Adding 1 links to index (crawl depth=0)... > Saved verbatim input to sources/E:\ArchiveBox\sources\1616819804-import.txt > Parsed 0 URLs from input (Failed to parse) > Found 0 new URLs not already in index [*] [2021-03-27 04:36:44] Writing 0 links to main index... √ E:\ArchiveBox\index.sqlite3 ```
kerem 2026-03-01 14:43:28 +03:00
Author
Owner

@pirate commented on GitHub (Mar 27, 2021):

youtube.com is not a url, https://youtube.com is a url.

Note this line in the output Failed to parse:

 > Parsed 0 URLs from input (Failed to parse)

https://developer.mozilla.org/en-US/docs/Learn/Common_questions/What_is_a_URL#basics_anatomy_of_a_url

You must add fully qualified URLs, bare domains are not enough.

<!-- gh-comment-id:808650588 --> @pirate commented on GitHub (Mar 27, 2021): `youtube.com` is not a url, `https://youtube.com` is a url. Note this line in the output `Failed to parse`: ```logs > Parsed 0 URLs from input (Failed to parse) ``` https://developer.mozilla.org/en-US/docs/Learn/Common_questions/What_is_a_URL#basics_anatomy_of_a_url You must add fully qualified URLs, bare domains are not enough.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#425
No description provided.