[GH-ISSUE #84] archive from a session buddy export fails #3082

Closed
opened 2026-03-14 20:56:17 +03:00 by kerem · 3 comments
Owner

Originally created by @thewriteway on GitHub (Jul 29, 2018).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/84

It doesn't seem to support session buddy JSON or html output.

This is chrome tool session buddy - https://chrome.google.com/webstore/detail/session-buddy/edacconmaakjimmfgnblocblbcdcpbko

When running the archive tool it produces an error message says no link could be found.

please see extract output sample of a session buddy html file:

session_buddy_export_2018_07_29_08_12_20.txt

Originally created by @thewriteway on GitHub (Jul 29, 2018). Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/84 It doesn't seem to support session buddy JSON or html output. This is chrome tool session buddy - https://chrome.google.com/webstore/detail/session-buddy/edacconmaakjimmfgnblocblbcdcpbko When running the archive tool it produces an error message says no link could be found. please see extract output sample of a session buddy html file: [session_buddy_export_2018_07_29_08_12_20.txt](https://github.com/pirate/bookmark-archiver/files/2238616/session_buddy_export_2018_07_29_08_12_20.txt)
kerem 2026-03-14 20:56:17 +03:00
Author
Owner

@pirate commented on GitHub (Jul 31, 2018):

For now you can run it through this bash one-liner and then try importing it:

perl -pe 's/(.+?)(<a href=")?(http.+?)?(" target="_blank">.+<\/a><\/li>)?$/<li><a href="$3" time_added="1400000000" tags="">$3<\/a><\/li>/gm' < session_budy_export.html > fixed_export.html
./archive fixed_export.html

Can you provide a sample of the JSON so I can try extending the parser for it?

<!-- gh-comment-id:409058201 --> @pirate commented on GitHub (Jul 31, 2018): For now you can run it through this bash one-liner and then try importing it: ```bash perl -pe 's/(.+?)(<a href=")?(http.+?)?(" target="_blank">.+<\/a><\/li>)?$/<li><a href="$3" time_added="1400000000" tags="">$3<\/a><\/li>/gm' < session_budy_export.html > fixed_export.html ./archive fixed_export.html ``` Can you provide a sample of the JSON so I can try extending the parser for it?
Author
Owner

@thewriteway commented on GitHub (Aug 5, 2018):

Sure, will try this out as well. Thanks

<!-- gh-comment-id:410498433 --> @thewriteway commented on GitHub (Aug 5, 2018): Sure, will try this out as well. Thanks
Author
Owner

@pirate commented on GitHub (Feb 4, 2019):

Try it now, it should be working on the latest master. Comment back if you're still having issues and I'll reopen the ticket.

<!-- gh-comment-id:460359270 --> @pirate commented on GitHub (Feb 4, 2019): Try it now, it should be working on the latest master. Comment back if you're still having issues and I'll reopen the ticket.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#3082
No description provided.