[GH-ISSUE #687] Import from Wallabag #446

Open
opened 2026-03-02 11:49:56 +03:00 by kerem · 5 comments
Owner

Originally created by @eshutoff on GitHub (Nov 22, 2024).
Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/687

Describe the feature you'd like

Hi Mohamed,
could you add the import from Wallabag formats (xml, csv, json)? I attached examples of the exported data from Wallabag version 2.6.9.
wallabag_export.zip

Describe the benefits this would bring to existing Hoarder users

Grab some Wallabag users

Can the goal of this request already be achieved via other means?

didn't find any

Have you searched for an existing open/closed issue?

  • I have searched for existing issues and none cover my fundamental request

Additional context

No response

Originally created by @eshutoff on GitHub (Nov 22, 2024). Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/687 ### Describe the feature you'd like Hi Mohamed, could you add the import from Wallabag formats (xml, csv, json)? I attached examples of the exported data from Wallabag version 2.6.9. [wallabag_export.zip](https://github.com/user-attachments/files/17876655/wallabag_export.zip) ### Describe the benefits this would bring to existing Hoarder users Grab some Wallabag users ### Can the goal of this request already be achieved via other means? didn't find any ### Have you searched for an existing open/closed issue? - [X] I have searched for existing issues and none cover my fundamental request ### Additional context _No response_
Author
Owner

@andrewlow commented on GitHub (Dec 10, 2024):

https://github.com/hoarder-app/hoarder/discussions/581 - discussion on someone doing this.. unclear if it was successful

<!-- gh-comment-id:2531791501 --> @andrewlow commented on GitHub (Dec 10, 2024): https://github.com/hoarder-app/hoarder/discussions/581 - discussion on someone doing this.. unclear if it was successful
Author
Owner

@1xPdd commented on GitHub (Jan 1, 2025):

unclear if it was successful

It looks like they imported a list of saved URLs. They say they lost tags. Presumably, they lost any page where the URL had changed or rotted away. For me, many of the items in my Wallabag library no longer exist on the Internet, so this sort of import would mean a huge loss.

<!-- gh-comment-id:2567049229 --> @1xPdd commented on GitHub (Jan 1, 2025): > unclear if it was successful It looks like they imported a list of saved URLs. They say they lost tags. Presumably, they lost any page where the URL had changed or rotted away. For me, many of the items in my Wallabag library no longer exist on the Internet, so this sort of import would mean a huge loss.
Author
Owner

@thiswillbeyourgithub commented on GitHub (Jan 1, 2025):

It looks like they imported a list of saved URLs. They say they lost tags. Presumably, they lost any page where the URL had changed or rotted away. For me, many of the items in my Wallabag library no longer exist on the Internet, so this sort of import would mean a huge loss.

this could be solved by a small script that fetches url, and if encounters a 404 or other errors tries to use waybackmachine for example. No?

<!-- gh-comment-id:2567050766 --> @thiswillbeyourgithub commented on GitHub (Jan 1, 2025): > It looks like they imported a list of saved URLs. They say they lost tags. Presumably, they lost any page where the URL had changed or rotted away. For me, many of the items in my Wallabag library no longer exist on the Internet, so this sort of import would mean a huge loss. this could be solved by a small script that fetches url, and if encounters a 404 or other errors tries to use waybackmachine for example. No?
Author
Owner

@1xPdd commented on GitHub (Feb 9, 2025):

Picking up stuff from waybackmachine or other archives would certainly be better than nothing and might make a nice user-facing option, but I personally have loads of things in Wallabag that never ended up there. I have no idea how unique my situation is, though.

<!-- gh-comment-id:2646558057 --> @1xPdd commented on GitHub (Feb 9, 2025): Picking up stuff from waybackmachine or other archives would certainly be better than nothing and might make a nice user-facing option, but I personally have loads of things in Wallabag that never ended up there. I have no idea how unique my situation is, though.
Author
Owner

@meonkeys commented on GitHub (Apr 28, 2025):

Not sure if it's clear from this discussion, but check out the wallabag export data: they include full cached/saved article content. Also, the wallabag server may also have all the images, so if that's on you have a very durable saved copy. I'd love it if karakeep also got the images!

<!-- gh-comment-id:2835874274 --> @meonkeys commented on GitHub (Apr 28, 2025): Not sure if it's clear from this discussion, but check out the wallabag export data: they include full cached/saved article content. Also, [the wallabag server may also have all the images](https://doc.wallabag.org/admin/internal_settings/#download-images-locally), so if that's on you have a very durable saved copy. I'd love it if karakeep also got the images!
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#446
No description provided.