[GH-ISSUE #2376] [Question] How to stop failed HTML importing? #1435

Open
opened 2026-03-02 11:57:20 +03:00 by kerem · 3 comments
Owner

Originally created by @lilskippyy on GitHub (Jan 11, 2026).
Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/2376

Hello. I uploaded a html bookmarks file with 2000 bookmarks but it got stuck in 1999 pending 1 complete for two days and it doesn't seem to make any progress. How can I stop it to start again?

Also, I have noticed that importing up to 1000 bookmarks from a single html it can handle okay, but not above that. Is that the expected behavior?

Thanks!

Originally created by @lilskippyy on GitHub (Jan 11, 2026). Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/2376 Hello. I uploaded a html bookmarks file with 2000 bookmarks but it got stuck in 1999 pending 1 complete for two days and it doesn't seem to make any progress. How can I stop it to start again? Also, I have noticed that importing up to 1000 bookmarks from a single html it can handle okay, but not above that. Is that the expected behavior? Thanks!
Author
Owner

@MohamedBassem commented on GitHub (Jan 11, 2026):

Hey, are you using karakeep cloud or the self hosted instance?

If you're using the self hosted instance, can you check if you have pending "crawling" or "inference" in the admin console? Also, what kind of problems are you facing with uploading more than 1000 bookmarks?

In general, I'm overhauling the whole import process to make it handle bigger imports more gracefully. It's expected in the next release.

<!-- gh-comment-id:3734676627 --> @MohamedBassem commented on GitHub (Jan 11, 2026): Hey, are you using karakeep cloud or the self hosted instance? If you're using the self hosted instance, can you check if you have pending "crawling" or "inference" in the admin console? Also, what kind of problems are you facing with uploading more than 1000 bookmarks? In general, I'm overhauling the whole import process to make it handle bigger imports more gracefully. It's expected in the next release.
Author
Owner

@lilskippyy commented on GitHub (Jan 11, 2026):

Hey, are you using karakeep cloud or the self hosted instance?

self hosted

If you're using the self hosted instance, can you check if you have pending "crawling" or "inference" in the admin console? Also, what kind of problems are you facing with uploading more than 1000 bookmarks?

It finally started importing. It's 42% now. But only after I shut most stuff that was running along with it in the same machine. Not sure if it helped.

What happens when I upload more than 1000 is exactly this. It will get stuck and not progress at all, or just process a few bookmarks and then stop.

In general, I'm overhauling the whole import process to make it handle bigger imports more gracefully. It's expected in the next release.

That's great!!

<!-- gh-comment-id:3735797747 --> @lilskippyy commented on GitHub (Jan 11, 2026): > Hey, are you using karakeep cloud or the self hosted instance? self hosted > If you're using the self hosted instance, can you check if you have pending "crawling" or "inference" in the admin console? Also, what kind of problems are you facing with uploading more than 1000 bookmarks? It finally started importing. It's 42% now. But only after I shut most stuff that was running along with it in the same machine. Not sure if it helped. What happens when I upload more than 1000 is exactly this. It will get stuck and not progress at all, or just process a few bookmarks and then stop. > In general, I'm overhauling the whole import process to make it handle bigger imports more gracefully. It's expected in the next release. That's great!!
Author
Owner

@scottcawley commented on GitHub (Jan 12, 2026):

Just to jump on the back of this thread - I've had some trouble this weekend trying to import stuff from an old Pocket export which showed similar behaviour.

I'm self hosted (and very new to SH, so I am amazed I have got this far) and the import is still 'processing' with 1 pending article, even though it's been going for 19 hours and says it's at 100%. My memory and swap were maxed out, but I restarted the container and things settled down. However, this one article is still 'pending'.

Separately, this feedback might not be useful, but just to say I have found the import process quite confusing, so it's good to hear that it might get an overhaul. My import says "1 pending / 126 completed / 948 failed" - even thought it looks like most (nearly all?) of the articles have imported and have not failed.

However, separately there are about 200+ broken links (which I would've thought would be considered 'failed' items), so I am not sure what the 948 figure is referring to, nor why the 200+ links aren't the failed ones. Nor why it says only 126 have completed when clearly a lot more than that have. Then on top of that, the crawler jobs and inference jobs have unprocessed and failed figures which don't total up to the failed import numbers either. Very confusing!

I am just trying to ignore all of this as using Karakeep in every other way is great, but it is scratching away at the back of my brain that something is not set up quite right!

Image Image
<!-- gh-comment-id:3736648277 --> @scottcawley commented on GitHub (Jan 12, 2026): Just to jump on the back of this thread - I've had some trouble this weekend trying to import stuff from an old Pocket export which showed similar behaviour. I'm self hosted (and very new to SH, so I am amazed I have got this far) and the import is still 'processing' with 1 pending article, even though it's been going for 19 hours and says it's at 100%. My memory and swap were maxed out, but I restarted the container and things settled down. However, this one article is still 'pending'. Separately, this feedback might not be useful, but just to say I have found the import process quite confusing, so it's good to hear that it might get an overhaul. My import says "1 pending / 126 completed / 948 failed" - even thought it looks like most (nearly all?) of the articles have imported and have not failed. However, separately there are about 200+ broken links (which I would've thought would be considered 'failed' items), so I am not sure what the 948 figure is referring to, nor why the 200+ links aren't the failed ones. Nor why it says only 126 have completed when clearly a *lot* more than that have. Then on top of that, the crawler jobs and inference jobs have unprocessed and failed figures which don't total up to the failed import numbers either. Very confusing! I am just trying to ignore all of this as using Karakeep in every other way is great, but it is scratching away at the back of my brain that something is not set up quite right! <img width="2712" height="632" alt="Image" src="https://github.com/user-attachments/assets/e10750b8-48c8-48c1-a10c-d5a90a4a2ef0" /> <img width="2776" height="810" alt="Image" src="https://github.com/user-attachments/assets/831042ef-b4ad-4b7e-a7b0-51c8f7565a46" />
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#1435
No description provided.