[PR #95] [MERGED] feature: Recrawl failed links #1526

Closed
opened 2026-03-02 11:58:02 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/karakeep-app/karakeep/pull/95
Author: @ahmadmucom
Created: 4/11/2024
Status: Merged
Merged: 4/11/2024
Merged by: @MohamedBassem

Base: mainHead: main


📝 Commits (5)

  • eac47f6 feature: Add PDF support
  • 4904d18 fix: PDF feature enhancements
  • 6f1022d fix: enhancing PDF feature
  • 0cf30f4 feature: Retry failed crawling URLs
  • 0319a42 fix: Enhancing visuals and some minor changes.

📊 Changes

8 files changed (+1067 additions, -25 deletions)

View changed files

📝 apps/web/app/dashboard/admin/page.tsx (+21 -5)
📝 apps/workers/crawlerWorker.ts (+20 -0)
packages/db/drizzle/0017_slippery_senator_kelly.sql (+3 -0)
packages/db/drizzle/meta/0017_snapshot.json (+967 -0)
📝 packages/db/drizzle/meta/_journal.json (+7 -0)
📝 packages/db/schema.ts (+3 -0)
📝 packages/trpc/routers/admin.ts (+45 -19)
📝 pnpm-lock.yaml (+1 -1)

📄 Description

  • Added a backoff for the Links Crawler Queue
  • Added a new field in ( crawStatus ) in the bookmarkLinks table
  • Added a new button the web to recrawl the failed URLs based on the crawStatus

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/karakeep-app/karakeep/pull/95 **Author:** [@ahmadmucom](https://github.com/ahmadmucom) **Created:** 4/11/2024 **Status:** ✅ Merged **Merged:** 4/11/2024 **Merged by:** [@MohamedBassem](https://github.com/MohamedBassem) **Base:** `main` ← **Head:** `main` --- ### 📝 Commits (5) - [`eac47f6`](https://github.com/karakeep-app/karakeep/commit/eac47f63ee798498aa2de8230b7adaa67783964a) feature: Add PDF support - [`4904d18`](https://github.com/karakeep-app/karakeep/commit/4904d18a9bce0e3cf16f9e882cce22fabdf099fc) fix: PDF feature enhancements - [`6f1022d`](https://github.com/karakeep-app/karakeep/commit/6f1022dd65e6043812ec40f95c0384e1f44fa3da) fix: enhancing PDF feature - [`0cf30f4`](https://github.com/karakeep-app/karakeep/commit/0cf30f497671d0dbbc02b620251377158ac23ee3) feature: Retry failed crawling URLs - [`0319a42`](https://github.com/karakeep-app/karakeep/commit/0319a42b8f3095b347be42ca154f6c13e41bf165) fix: Enhancing visuals and some minor changes. ### 📊 Changes **8 files changed** (+1067 additions, -25 deletions) <details> <summary>View changed files</summary> 📝 `apps/web/app/dashboard/admin/page.tsx` (+21 -5) 📝 `apps/workers/crawlerWorker.ts` (+20 -0) ➕ `packages/db/drizzle/0017_slippery_senator_kelly.sql` (+3 -0) ➕ `packages/db/drizzle/meta/0017_snapshot.json` (+967 -0) 📝 `packages/db/drizzle/meta/_journal.json` (+7 -0) 📝 `packages/db/schema.ts` (+3 -0) 📝 `packages/trpc/routers/admin.ts` (+45 -19) 📝 `pnpm-lock.yaml` (+1 -1) </details> ### 📄 Description - Added a backoff for the Links Crawler Queue - Added a new field in ( crawStatus ) in the bookmarkLinks table - Added a new button the web to recrawl the failed URLs based on the crawStatus --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-02 11:58:02 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#1526
No description provided.