[GH-ISSUE #63] Waiting for service worker #58

Closed
opened 2026-03-02 11:46:09 +03:00 by kerem · 3 comments
Owner

Originally created by @bibinisacmohan on GitHub (Apr 1, 2024).
Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/63

The webpage opens up fine. However, the Chrome extension shows the following error -

Screenshot 2024-04-01 at 3 01 43 PM

Also, I noticed that site crawling has stopped. No preview of the webpage is available.

Following is the docker compose -

version: "3.8"
services:
web:
image: ghcr.io/mohamedbassem/hoarder-web:${HOARDER_VERSION:-release}
restart: unless-stopped
volumes:
- /volume1/docker/hoarder/data:/data
ports:
- 3003:3000
env_file:
- .env
environment:
REDIS_HOST: redis
MEILI_ADDR: http://meilisearch:7700
DATA_DIR: /data
redis:
image: redis:7.2-alpine
restart: unless-stopped
volumes:
- /volume1/docker/hoarder/redis:/data
chrome:
image: gcr.io/zenika-hub/alpine-chrome:100
restart: unless-stopped
command:
- --no-sandbox
- --disable-gpu
- --remote-debugging-address=0.0.0.0
- --remote-debugging-port=9222
meilisearch:
image: getmeili/meilisearch:v1.6
restart: unless-stopped
env_file:
- .env
volumes:
- /volume1/docker/hoarder/meilisearch:/meili_data
workers:
image: ghcr.io/mohamedbassem/hoarder-workers:${HOARDER_VERSION:-release}
restart: unless-stopped
volumes:
- /volume1/docker/hoarder/data:/data
env_file:
- .env
environment:
REDIS_HOST: redis
MEILI_ADDR: http://meilisearch:7700
BROWSER_WEB_URL: http://chrome:9222
DATA_DIR: /data
# OPENAI_API_KEY: ...
depends_on:
web:
condition: service_started

Originally created by @bibinisacmohan on GitHub (Apr 1, 2024). Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/63 The webpage opens up fine. However, the Chrome extension shows the following error - ![Screenshot 2024-04-01 at 3 01 43 PM](https://github.com/MohamedBassem/hoarder-app/assets/125105686/aeb606e6-5637-4a43-b618-2fea1d9c66cd) Also, I noticed that site crawling has stopped. No preview of the webpage is available. Following is the docker compose - version: "3.8" services: web: image: ghcr.io/mohamedbassem/hoarder-web:${HOARDER_VERSION:-release} restart: unless-stopped volumes: - /volume1/docker/hoarder/data:/data ports: - 3003:3000 env_file: - .env environment: REDIS_HOST: redis MEILI_ADDR: http://meilisearch:7700 DATA_DIR: /data redis: image: redis:7.2-alpine restart: unless-stopped volumes: - /volume1/docker/hoarder/redis:/data chrome: image: gcr.io/zenika-hub/alpine-chrome:100 restart: unless-stopped command: - --no-sandbox - --disable-gpu - --remote-debugging-address=0.0.0.0 - --remote-debugging-port=9222 meilisearch: image: getmeili/meilisearch:v1.6 restart: unless-stopped env_file: - .env volumes: - /volume1/docker/hoarder/meilisearch:/meili_data workers: image: ghcr.io/mohamedbassem/hoarder-workers:${HOARDER_VERSION:-release} restart: unless-stopped volumes: - /volume1/docker/hoarder/data:/data env_file: - .env environment: REDIS_HOST: redis MEILI_ADDR: http://meilisearch:7700 BROWSER_WEB_URL: http://chrome:9222 DATA_DIR: /data # OPENAI_API_KEY: ... depends_on: web: condition: service_started
kerem 2026-03-02 11:46:09 +03:00
  • closed this issue
  • added the
    bug
    label
Author
Owner

@MohamedBassem commented on GitHub (Apr 1, 2024):

Regarding the extension, I uploaded a development build by mistake. I now submitted a hotfix for the chrome store and waiting for google to approve. Sorry about that.

Regarding the crawler. I’ve seen one instance where the crawler gets stuck in some webpages. I have one such link, and I’m working on a fix. If you check the last crawled link in your worker logs, it’s usually that one. If you remove it from your bookmarks and restart the workers, crawling should continue. If it’s ok to share that link with me as well for debugging purposes, I’d be thankful, if not, that’s fine too.

Sorry for the inconvenience.

<!-- gh-comment-id:2030378274 --> @MohamedBassem commented on GitHub (Apr 1, 2024): Regarding the extension, I uploaded a development build by mistake. I now submitted a hotfix for the chrome store and waiting for google to approve. Sorry about that. Regarding the crawler. I’ve seen one instance where the crawler gets stuck in some webpages. I have one such link, and I’m working on a fix. If you check the last crawled link in your worker logs, it’s usually that one. If you remove it from your bookmarks and restart the workers, crawling should continue. If it’s ok to share that link with me as well for debugging purposes, I’d be thankful, if not, that’s fine too. Sorry for the inconvenience.
Author
Owner

@MohamedBassem commented on GitHub (Apr 2, 2024):

The new version is now published to the chrome store. Version 1.1.2. Chrome should auto update it for you.

<!-- gh-comment-id:2031653148 --> @MohamedBassem commented on GitHub (Apr 2, 2024): The new version is now published to the chrome store. Version 1.1.2. Chrome should auto update it for you.
Author
Owner

@bibinisacmohan commented on GitHub (Apr 2, 2024):

Thanks for the quick fix. Following is the link that was giving trouble -
https://www.tripadvisor.com/Attraction_Review-g60784-d261325-Reviews-Cuyahoga_Valley_National_Park-Brecksville_Ohio.html

I removed and resubmitted the crawl, and everything is working fine now.

On Mon, Apr 1, 2024 at 3:09 PM Mohamed Bassem @.***>
wrote:

Regarding the extension, I uploaded a development build by mistake. I now
submitted a hotfix for the chrome store and waiting for google to approve.
Sorry about that.

Regarding the crawler. I’ve seen one instance where the crawler gets stuck
in some webpages. I have one such link, and I’m working on a fix. If you
check the last crawled link in your worker logs, it’s usually that one. If
you remove it from your bookmarks and restart the workers, crawling should
continue. If it’s ok to share that link with me as well for debugging
purposes, I’d be thankful, if not, that’s fine too.

Sorry for the inconvenience.


Reply to this email directly, view it on GitHub
https://github.com/MohamedBassem/hoarder-app/issues/63#issuecomment-2030378274,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/A52PMFQZLYICKWYJGMB2IILY3GWGXAVCNFSM6AAAAABFSBRYL6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMZQGM3TQMRXGQ
.
You are receiving this because you authored the thread.Message ID:
@.***>

<!-- gh-comment-id:2031736036 --> @bibinisacmohan commented on GitHub (Apr 2, 2024): Thanks for the quick fix. Following is the link that was giving trouble - https://www.tripadvisor.com/Attraction_Review-g60784-d261325-Reviews-Cuyahoga_Valley_National_Park-Brecksville_Ohio.html I removed and resubmitted the crawl, and everything is working fine now. On Mon, Apr 1, 2024 at 3:09 PM Mohamed Bassem ***@***.***> wrote: > Regarding the extension, I uploaded a development build by mistake. I now > submitted a hotfix for the chrome store and waiting for google to approve. > Sorry about that. > > Regarding the crawler. I’ve seen one instance where the crawler gets stuck > in some webpages. I have one such link, and I’m working on a fix. If you > check the last crawled link in your worker logs, it’s usually that one. If > you remove it from your bookmarks and restart the workers, crawling should > continue. If it’s ok to share that link with me as well for debugging > purposes, I’d be thankful, if not, that’s fine too. > > Sorry for the inconvenience. > > — > Reply to this email directly, view it on GitHub > <https://github.com/MohamedBassem/hoarder-app/issues/63#issuecomment-2030378274>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/A52PMFQZLYICKWYJGMB2IILY3GWGXAVCNFSM6AAAAABFSBRYL6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMZQGM3TQMRXGQ> > . > You are receiving this because you authored the thread.Message ID: > ***@***.***> >
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#58
No description provided.