[GH-ISSUE #720] Error when fetching every page: Failed to fetch link content ... #467

Closed
opened 2026-03-02 11:50:06 +03:00 by kerem · 10 comments
Owner

Originally created by @Deus-Glen on GitHub (Dec 7, 2024).
Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/720

Describe the Bug

When fetching each page, the following error is displayed after a short load:
Failed to fetch link content ...

Steps to Reproduce

  1. Paste a website link, and prees "save"

Expected Behaviour

get the content of link

Screenshots or Additional Context

my docker compose file:
image

logs of chrome:
image

Device Details

131.0.6778.109 64bit on Windows11

Exact Hoarder Version

Hoarder v0.19.0

Originally created by @Deus-Glen on GitHub (Dec 7, 2024). Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/720 ### Describe the Bug When fetching each page, the following error is displayed after a short load: Failed to fetch link content ... ### Steps to Reproduce 1. Paste a website link, and prees "save" ### Expected Behaviour get the content of link ### Screenshots or Additional Context my docker compose file: ![image](https://github.com/user-attachments/assets/fcaa1f2a-4fc9-42e3-9eb0-e4cc1058bd17) logs of chrome: ![image](https://github.com/user-attachments/assets/c3e77bbd-49a6-4ce7-ad8d-94224339198b) ### Device Details 131.0.6778.109 64bit on Windows11 ### Exact Hoarder Version Hoarder v0.19.0
kerem closed this issue 2026-03-02 11:50:07 +03:00
Author
Owner

@MohamedBassem commented on GitHub (Dec 7, 2024):

can you share the logs from the web container?

<!-- gh-comment-id:2525069572 --> @MohamedBassem commented on GitHub (Dec 7, 2024): can you share the logs from the web container?
Author
Owner

@Deus-Glen commented on GitHub (Dec 7, 2024):

like this
image

<!-- gh-comment-id:2525070122 --> @Deus-Glen commented on GitHub (Dec 7, 2024): like this ![image](https://github.com/user-attachments/assets/1f9d07ec-b760-45f8-bb0f-592fe1c26ee8)
Author
Owner

@MohamedBassem commented on GitHub (Dec 7, 2024):

yes, can you add a link and share what gets logged there after adding the link?

<!-- gh-comment-id:2525070497 --> @MohamedBassem commented on GitHub (Dec 7, 2024): yes, can you add a link and share what gets logged there after adding the link?
Author
Owner

@Deus-Glen commented on GitHub (Dec 7, 2024):

the chrome container change like this:
image

no change in the web container

<!-- gh-comment-id:2525071689 --> @Deus-Glen commented on GitHub (Dec 7, 2024): the chrome container change like this: ![image](https://github.com/user-attachments/assets/ac7af153-43c3-4cd5-ac6e-693a50a4a150) no change in the web container
Author
Owner

@MohamedBassem commented on GitHub (Dec 7, 2024):

the chrome container doesn't matter. But I'm surprised that the web container doesn't change at all? Are you sure?

<!-- gh-comment-id:2525072183 --> @MohamedBassem commented on GitHub (Dec 7, 2024): the chrome container doesn't matter. But I'm surprised that the web container doesn't change at all? Are you sure?
Author
Owner

@Deus-Glen commented on GitHub (Dec 7, 2024):

command:docker logs hoarder_web_1
it only changes when I delete the bookmark.
image

<!-- gh-comment-id:2525072854 --> @Deus-Glen commented on GitHub (Dec 7, 2024): command:docker logs hoarder_web_1 it only changes when I delete the bookmark. ![image](https://github.com/user-attachments/assets/974bf0ad-86b7-48f1-859a-fb27dd3e6995)
Author
Owner

@MohamedBassem commented on GitHub (Dec 7, 2024):

Ah, I think I know what's going on. Can you try the nightly version? I landed some fixes there that clarifies the problem. Change hoarder version to "latest", repull the images and try again

<!-- gh-comment-id:2525073426 --> @MohamedBassem commented on GitHub (Dec 7, 2024): Ah, I think I know what's going on. Can you try the nightly version? I landed some fixes there that clarifies the problem. Change hoarder version to "latest", repull the images and try again
Author
Owner

@Deus-Glen commented on GitHub (Dec 7, 2024):

I change the hoarder version to "latest" ,repull the images and try again, but it seems work the same, no change happens.
new log for web container:
image

<!-- gh-comment-id:2525076826 --> @Deus-Glen commented on GitHub (Dec 7, 2024): I change the hoarder version to "latest" ,repull the images and try again, but it seems work the same, no change happens. new log for web container: ![image](https://github.com/user-attachments/assets/6131b18d-e774-4254-b921-6a99594a8b4f)
Author
Owner

@Deus-Glen commented on GitHub (Dec 8, 2024):

I’m not sure exactly what happened, but today my Hoarder can fetch web pages normally. The web container is also outputting logs correctly. It seems that the issue has indeed been resolved in the latest version.

Thank you so much for your prompt and swift response!

<!-- gh-comment-id:2525956634 --> @Deus-Glen commented on GitHub (Dec 8, 2024): I’m not sure exactly what happened, but today my Hoarder can fetch web pages normally. The web container is also outputting logs correctly. It seems that the issue has indeed been resolved in the latest version. Thank you so much for your prompt and swift response!
Author
Owner

@MohamedBassem commented on GitHub (Dec 9, 2024):

@Deus-Glen The worker container on startup downloads ads blocklists from github. If you're in a country where github is blocked (or with internet restriction in general), the worker will take some time until it timesout and skips downloading the adblocker list (that was the fix in the last release). It'll however keep happening every time you restart the worker. So you might want to set CRAWLER_ENABLE_ADBLOCKER=false in your env file.

<!-- gh-comment-id:2526556192 --> @MohamedBassem commented on GitHub (Dec 9, 2024): @Deus-Glen The worker container on startup downloads ads blocklists from github. If you're in a country where github is blocked (or with internet restriction in general), the worker will take some time until it timesout and skips downloading the adblocker list (that was the fix in the last release). It'll however keep happening every time you restart the worker. So you might want to set `CRAWLER_ENABLE_ADBLOCKER=false` in your env file.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#467
No description provided.