[GH-ISSUE #362] Crawler Failed #235

Closed
opened 2026-03-02 11:47:52 +03:00 by kerem · 8 comments
Owner

Originally created by @techdixie on GitHub (Aug 23, 2024).
Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/362

hoarder_workers | 2024-08-23T19:24:44.650Z error: [Crawler] Failed to connect to the browser instance, will retry in 5 secs hoarder_workers | 2024-08-23T19:24:49.651Z info: [Crawler] Connecting to existing browser instance: http://chrome:9222

I was getting some errors so I upgraded to the latest release. Now I am getting a Crawler error. I haven't touched chrome. I am not using kubernetes.

services: web: image: ghcr.io/hoarder-app/hoarder-web:latest container_name: hoarder_web restart: unless-stopped volumes: - data:/data ports: - 3000:3000 env_file: - .env environment: REDIS_HOST: redis MEILI_ADDR: http://meilisearch:7700 DATA_DIR: /data HOARDER_VERSION: release NEXTAUTH_SECRET: generated_key MEILI_MASTER_KEY: second_generated_key NEXTAUTH_URL: http://ip:3000 redis: image: redis:7.2-alpine container_name: hoarder_redis restart: unless-stopped volumes: - redis:/data chrome: image: gcr.io/zenika-hub/alpine-chrome:123 container_name: hoarder_chrome restart: unless-stopped command: - --no-sandbox - --disable-gpu - --disable-dev-shm-usage - --remote-debugging-address=0.0.0.0 - --remote-debugging-port=9222 - --hide-scrollbars meilisearch: image: getmeili/meilisearch:v1.6 container_name: hoarder_meilisearch restart: unless-stopped env_file: - .env environment: MEILI_NO_ANALYTICS: "true" volumes: - meilisearch:/meili_data workers: image: ghcr.io/hoarder-app/hoarder-workers:latest container_name: hoarder_workers restart: unless-stopped volumes: - data:/data env_file: - .env environment: REDIS_HOST: redis MEILI_ADDR: http://meilisearch:7700 BROWSER_WEB_URL: http://chrome:9222 DATA_DIR: /data OPENAI_API_KEY: open_ai_key INFERENCE_TEXT_MODEL: gpt-4o-mini INFERENCE_IMAGE_MODEL: gpt-4o-mini depends_on: web: condition: service_started volumes: redis: null meilisearch: null data: null networks: synobridge: external: true

Thanks! (edit: no clue how to keep formatting :/)

Originally created by @techdixie on GitHub (Aug 23, 2024). Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/362 `hoarder_workers | 2024-08-23T19:24:44.650Z error: [Crawler] Failed to connect to the browser instance, will retry in 5 secs hoarder_workers | 2024-08-23T19:24:49.651Z info: [Crawler] Connecting to existing browser instance: http://chrome:9222` I was getting some errors so I upgraded to the latest release. Now I am getting a Crawler error. I haven't touched chrome. I am not using kubernetes. `services: web: image: ghcr.io/hoarder-app/hoarder-web:latest container_name: hoarder_web restart: unless-stopped volumes: - data:/data ports: - 3000:3000 env_file: - .env environment: REDIS_HOST: redis MEILI_ADDR: http://meilisearch:7700 DATA_DIR: /data HOARDER_VERSION: release NEXTAUTH_SECRET: generated_key MEILI_MASTER_KEY: second_generated_key NEXTAUTH_URL: http://ip:3000 redis: image: redis:7.2-alpine container_name: hoarder_redis restart: unless-stopped volumes: - redis:/data chrome: image: gcr.io/zenika-hub/alpine-chrome:123 container_name: hoarder_chrome restart: unless-stopped command: - --no-sandbox - --disable-gpu - --disable-dev-shm-usage - --remote-debugging-address=0.0.0.0 - --remote-debugging-port=9222 - --hide-scrollbars meilisearch: image: getmeili/meilisearch:v1.6 container_name: hoarder_meilisearch restart: unless-stopped env_file: - .env environment: MEILI_NO_ANALYTICS: "true" volumes: - meilisearch:/meili_data workers: image: ghcr.io/hoarder-app/hoarder-workers:latest container_name: hoarder_workers restart: unless-stopped volumes: - data:/data env_file: - .env environment: REDIS_HOST: redis MEILI_ADDR: http://meilisearch:7700 BROWSER_WEB_URL: http://chrome:9222 DATA_DIR: /data OPENAI_API_KEY: open_ai_key INFERENCE_TEXT_MODEL: gpt-4o-mini INFERENCE_IMAGE_MODEL: gpt-4o-mini depends_on: web: condition: service_started volumes: redis: null meilisearch: null data: null networks: synobridge: external: true ` Thanks! (edit: no clue how to keep formatting :/)
kerem 2026-03-02 11:47:52 +03:00
  • closed this issue
  • added the
    question
    label
Author
Owner

@MohamedBassem commented on GitHub (Aug 24, 2024):

Hey, I see that you changed the container name of chrome to hoarder_chrome, in that case, you'll need to change the BROWSER_WEB_URL to http://hoarder_chrome:9222

<!-- gh-comment-id:2308432243 --> @MohamedBassem commented on GitHub (Aug 24, 2024): Hey, I see that you changed the container name of chrome to `hoarder_chrome`, in that case, you'll need to change the `BROWSER_WEB_URL` to `http://hoarder_chrome:9222`
Author
Owner

@techdixie commented on GitHub (Aug 24, 2024):

I'm wondering if my container rename is messing more than that up. I'll remove those. I'm not getting tagging, homepage now says "Something went wrong". The app says its failing at crawling.

<!-- gh-comment-id:2308460370 --> @techdixie commented on GitHub (Aug 24, 2024): I'm wondering if my container rename is messing more than that up. I'll remove those. I'm not getting tagging, homepage now says "Something went wrong". The app says its failing at crawling.
Author
Owner

@techdixie commented on GitHub (Aug 24, 2024):

ok, this corrected ai tagging and the crawlers. Only thing that looks like its not working is the homepage that says "Something went Wrong"

<!-- gh-comment-id:2308476440 --> @techdixie commented on GitHub (Aug 24, 2024): ok, this corrected ai tagging and the crawlers. Only thing that looks like its not working is the homepage that says "Something went Wrong"
Author
Owner

@MohamedBassem commented on GitHub (Aug 24, 2024):

Can you share the logs of the web container?

<!-- gh-comment-id:2308540140 --> @MohamedBassem commented on GitHub (Aug 24, 2024): Can you share the logs of the web container?
Author
Owner

@techdixie commented on GitHub (Aug 24, 2024):

hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async p (/app/apps/web/.next/server/chunks/624.js:4:33306) hoarder-web-1 | at async p (/app/apps/web/.next/server/app/dashboard/bookmarks/page.js:1:10468) { hoarder-web-1 | issues: [ [Object] ], hoarder-web-1 | addIssue: [Function (anonymous)], hoarder-web-1 | addIssues: [Function (anonymous)], hoarder-web-1 | errors: [ [Object] ] hoarder-web-1 | } hoarder-web-1 | } hoarder-web-1 | [Error: An error occurred in the Server Components render. The specific message is omitted in production builds to avoid leaking sensitive details. A digest property is included on this error instance which may provide additional details about the nature of the error.] { hoarder-web-1 | digest: '2631817619' hoarder-web-1 | } hoarder-web-1 | m [TRPCError]: Output validation failed hoarder-web-1 | at p (/app/apps/web/.next/server/chunks/624.js:4:32429) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | ... 4 lines matching cause stack trace ... hoarder-web-1 | at async p (/app/apps/web/.next/server/app/dashboard/bookmarks/page.js:1:10468) { hoarder-web-1 | code: 'INTERNAL_SERVER_ERROR', hoarder-web-1 | [cause]: o [ZodError]: [ hoarder-web-1 | { hoarder-web-1 | "code": "invalid_type", hoarder-web-1 | "expected": "string", hoarder-web-1 | "received": "undefined", hoarder-web-1 | "path": [ hoarder-web-1 | "bookmarks", hoarder-web-1 | 2, hoarder-web-1 | "content", hoarder-web-1 | "url" hoarder-web-1 | ], hoarder-web-1 | "message": "Required" hoarder-web-1 | } hoarder-web-1 | ] hoarder-web-1 | at get error [as error] (/app/apps/web/.next/server/chunks/917.js:63:14269) hoarder-web-1 | at Q.parseAsync (/app/apps/web/.next/server/chunks/917.js:63:16752) hoarder-web-1 | at async p (/app/apps/web/.next/server/chunks/624.js:4:32378) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async p (/app/apps/web/.next/server/chunks/624.js:4:33306) hoarder-web-1 | at async p (/app/apps/web/.next/server/app/dashboard/bookmarks/page.js:1:10468) { hoarder-web-1 | issues: [ [Object] ], hoarder-web-1 | addIssue: [Function (anonymous)], hoarder-web-1 | addIssues: [Function (anonymous)], hoarder-web-1 | errors: [ [Object] ] hoarder-web-1 | } hoarder-web-1 | }

<!-- gh-comment-id:2308552931 --> @techdixie commented on GitHub (Aug 24, 2024): `hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async p (/app/apps/web/.next/server/chunks/624.js:4:33306) hoarder-web-1 | at async p (/app/apps/web/.next/server/app/dashboard/bookmarks/page.js:1:10468) { hoarder-web-1 | issues: [ [Object] ], hoarder-web-1 | addIssue: [Function (anonymous)], hoarder-web-1 | addIssues: [Function (anonymous)], hoarder-web-1 | errors: [ [Object] ] hoarder-web-1 | } hoarder-web-1 | } hoarder-web-1 | [Error: An error occurred in the Server Components render. The specific message is omitted in production builds to avoid leaking sensitive details. A digest property is included on this error instance which may provide additional details about the nature of the error.] { hoarder-web-1 | digest: '2631817619' hoarder-web-1 | } hoarder-web-1 | m [TRPCError]: Output validation failed hoarder-web-1 | at p (/app/apps/web/.next/server/chunks/624.js:4:32429) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | ... 4 lines matching cause stack trace ... hoarder-web-1 | at async p (/app/apps/web/.next/server/app/dashboard/bookmarks/page.js:1:10468) { hoarder-web-1 | code: 'INTERNAL_SERVER_ERROR', hoarder-web-1 | [cause]: o [ZodError]: [ hoarder-web-1 | { hoarder-web-1 | "code": "invalid_type", hoarder-web-1 | "expected": "string", hoarder-web-1 | "received": "undefined", hoarder-web-1 | "path": [ hoarder-web-1 | "bookmarks", hoarder-web-1 | 2, hoarder-web-1 | "content", hoarder-web-1 | "url" hoarder-web-1 | ], hoarder-web-1 | "message": "Required" hoarder-web-1 | } hoarder-web-1 | ] hoarder-web-1 | at get error [as error] (/app/apps/web/.next/server/chunks/917.js:63:14269) hoarder-web-1 | at Q.parseAsync (/app/apps/web/.next/server/chunks/917.js:63:16752) hoarder-web-1 | at async p (/app/apps/web/.next/server/chunks/624.js:4:32378) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async d (/app/apps/web/.next/server/chunks/624.js:4:32967) hoarder-web-1 | at async p (/app/apps/web/.next/server/chunks/624.js:4:33306) hoarder-web-1 | at async p (/app/apps/web/.next/server/app/dashboard/bookmarks/page.js:1:10468) { hoarder-web-1 | issues: [ [Object] ], hoarder-web-1 | addIssue: [Function (anonymous)], hoarder-web-1 | addIssues: [Function (anonymous)], hoarder-web-1 | errors: [ [Object] ] hoarder-web-1 | } hoarder-web-1 | }`
Author
Owner

@MohamedBassem commented on GitHub (Sep 22, 2024):

@techdixie did you figure this out?

<!-- gh-comment-id:2366831127 --> @MohamedBassem commented on GitHub (Sep 22, 2024): @techdixie did you figure this out?
Author
Owner

@techdixie commented on GitHub (Sep 23, 2024):

Mine works perfectly now, but I did nothing to fix it unless there was an update at some point. Sorry I can't be of more assistance.

<!-- gh-comment-id:2368644683 --> @techdixie commented on GitHub (Sep 23, 2024): Mine works perfectly now, but I did nothing to fix it unless there was an update at some point. Sorry I can't be of more assistance.
Author
Owner

@kamtschatka commented on GitHub (Oct 13, 2024):

closing as it works now

<!-- gh-comment-id:2409030310 --> @kamtschatka commented on GitHub (Oct 13, 2024): closing as it works now
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#235
No description provided.