[GH-ISSUE #663] Issues Crawling Youtube Shorts link because of cookies consent banner. #425

Closed
opened 2026-03-02 11:49:43 +03:00 by kerem · 1 comment
Owner

Originally created by @gercollo on GitHub (Nov 16, 2024).
Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/663

Describe the Bug

The crawler is unable to properly scrape YouTube Shorts links due to the cookie consent banner that appears for EU/EEA users. This banner blocks access to the content and prevents successful crawling of Shorts videos.

Steps to Reproduce

  • Cookie consent popup appears when accessing YouTube Shorts links
  • Crawler cannot bypass or handle the consent banner
  • Content behind the banner is inaccessible
  • Crawling returns inaccurate metadata

Expected Behaviour

Tag Generation based on the youtube title

Screenshots or Additional Context

image

Device Details

No response

Exact Hoarder Version

v19

Originally created by @gercollo on GitHub (Nov 16, 2024). Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/663 ### Describe the Bug The crawler is unable to properly scrape YouTube Shorts links due to the cookie consent banner that appears for EU/EEA users. This banner blocks access to the content and prevents successful crawling of Shorts videos. ### Steps to Reproduce - Cookie consent popup appears when accessing YouTube Shorts links - Crawler cannot bypass or handle the consent banner - Content behind the banner is inaccessible - Crawling returns inaccurate metadata ### Expected Behaviour Tag Generation based on the youtube title ### Screenshots or Additional Context <img width="1311" alt="image" src="https://github.com/user-attachments/assets/1c52b98d-b857-4cf9-928c-17953f813300"> ### Device Details _No response_ ### Exact Hoarder Version v19
kerem 2026-03-02 11:49:43 +03:00
  • closed this issue
  • added the
    bug
    label
Author
Owner

@MohamedBassem commented on GitHub (Nov 17, 2024):

Yeah, this is getting annoying. I'm planning to better tackle them hopefully sometime soon. Let's track this in #414 instead

<!-- gh-comment-id:2480875974 --> @MohamedBassem commented on GitHub (Nov 17, 2024): Yeah, this is getting annoying. I'm planning to better tackle them hopefully sometime soon. Let's track this in #414 instead
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#425
No description provided.