[GH-ISSUE #2334] (Reddit) RSS feed not respecting provided metadata (date/author) #1416

Open
opened 2026-03-02 11:57:09 +03:00 by kerem · 0 comments
Owner

Originally created by @leonbeon on GitHub (Jan 2, 2026).
Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/2334

Describe the Bug

I am importing a reddit saved feed using the rss feed generator of saved links that reddit provides, which generates a URL that looks like this: https://old.reddit.com/saved.rss?feed=...&user=.... There are multiple issues with this when importing Karakeep (e.g. capturing the side bar content on old reddit for read mode, even when providing login cookies so the side capture works), but I want to focus on what I think is simply a bug in RSS feed parsing: Metadata is not respected when importing links.

This features works in Firefox bookmark import for example, but the RSS import doesn't seem to respect the saved date. Here is a snippet of a reddit rss feed:

<feed>
	<entry>
		<author>
			<name>/u/username</name>
			<uri>https://old.reddit.com/user/username</uri>
		</author>
		<category term="sub" label="r/sub"/>
		<content type="html">
			<!-- SC_OFF -->
			<div class="md">
				<p>...</p>
			</div>
			<!-- SC_ON -->
		</content>
		<id>post_id</id>
		<link href="https://example.org/permalink"/>
		<updated>2025-12-31T12:28:39+00:00</updated>
		<title>
Post title
</title>
	</entry>
</feed>

When you save a post and not a comment like above, you get another property "published", but this seems to be the same as "updated".
This is a comment published on 2025-12-31, but importing it today shows todays date 2025-01-02 as the creation date.

The author of the post is also marked as my own account, and not the actual publisher, even though this data is available in the <author> tag.

Steps to Reproduce

  1. Generate a reddit saved post feed by visiting https://old.reddit.com/prefs/feeds/ and saving the link under private listings > rss.
  2. Import this link in KaraKeeps RSS feature
  3. Look at the generated metadata for the saved links

Expected Behaviour

I expected the date and author fields to be filled correctly, especially because the URL and title fields are extracted properly. A nice touch would also be to include the subreddit somehow (provided by the category tag), either as tag or in the publisher field instead of the generic "reddit".

Screenshots or Additional Context

No response

Device Details

No response

Exact Karakeep Version

0.30.0

Have you checked the troubleshooting guide?

  • I have checked the troubleshooting guide and I haven't found a solution to my problem
Originally created by @leonbeon on GitHub (Jan 2, 2026). Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/2334 ### Describe the Bug I am importing a reddit saved feed using the [rss feed generator of saved links](https://old.reddit.com/prefs/feeds/) that reddit provides, which generates a URL that looks like this: `https://old.reddit.com/saved.rss?feed=...&user=...`. There are multiple issues with this when importing Karakeep (e.g. capturing the side bar content on old reddit for read mode, even when providing login cookies so the side capture works), but I want to focus on what I think is simply a bug in RSS feed parsing: Metadata is not respected when importing links. This features works in Firefox bookmark import for example, but the RSS import doesn't seem to respect the saved date. Here is a snippet of a reddit rss feed: ```xml <feed> <entry> <author> <name>/u/username</name> <uri>https://old.reddit.com/user/username</uri> </author> <category term="sub" label="r/sub"/> <content type="html"> <!-- SC_OFF --> <div class="md"> <p>...</p> </div> <!-- SC_ON --> </content> <id>post_id</id> <link href="https://example.org/permalink"/> <updated>2025-12-31T12:28:39+00:00</updated> <title> Post title </title> </entry> </feed> ``` When you save a post and not a comment like above, you get another property "published", but this seems to be the same as "updated". This is a comment published on 2025-12-31, but importing it today shows todays date 2025-01-02 as the creation date. The author of the post is also marked as my own account, and not the actual publisher, even though this data is available in the `<author>` tag. ### Steps to Reproduce 1. Generate a reddit saved post feed by visiting https://old.reddit.com/prefs/feeds/ and saving the link under private listings > rss. 2. Import this link in KaraKeeps RSS feature 3. Look at the generated metadata for the saved links ### Expected Behaviour I expected the date and author fields to be filled correctly, especially because the URL and title fields are extracted properly. A nice touch would also be to include the subreddit somehow (provided by the category tag), either as tag or in the publisher field instead of the generic "reddit". ### Screenshots or Additional Context _No response_ ### Device Details _No response_ ### Exact Karakeep Version 0.30.0 ### Have you checked the troubleshooting guide? - [x] I have checked the troubleshooting guide and I haven't found a solution to my problem
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#1416
No description provided.