[PR #449] [MERGED] A round of fixes #1169

Closed
opened 2026-03-01 14:48:43 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/449
Author: @pirate
Created: 8/18/2020
Status: Merged
Merged: 8/18/2020
Merged by: @pirate

Base: masterHead: dev


📝 Commits (10+)

  • a682a9c make all parsers accept arbitrary meta kwargs
  • 15efb2d new generic_html parser for extracting hrefs
  • b0c0a67 re-enable readability and singlefile by default now that its less noisy
  • f18d925 wip attempt to fix timestamp unique constraint errors
  • c9b3bab fix pull title not working
  • 6087e30 dont allow ui editing of db fields for now
  • 273588c change main link to point to link details index to reduce confusion
  • 430d517 show archive dir size in link details page
  • e29bfba change iframe panel size and shape on details index
  • 61ab952 fix parser docstring

📊 Changes

19 files changed (+219 additions, -136 deletions)

View changed files

📝 archivebox.egg-info/SOURCES.txt (+1 -0)
📝 archivebox/config/__init__.py (+2 -2)
📝 archivebox/core/admin.py (+5 -5)
📝 archivebox/extractors/__init__.py (+2 -2)
📝 archivebox/index/__init__.py (+23 -18)
📝 archivebox/index/html.py (+2 -0)
📝 archivebox/index/sql.py (+4 -0)
📝 archivebox/main.py (+3 -3)
📝 archivebox/parsers/__init__.py (+48 -35)
archivebox/parsers/generic_html.py (+53 -0)
📝 archivebox/parsers/generic_json.py (+1 -1)
📝 archivebox/parsers/generic_rss.py (+1 -1)
📝 archivebox/parsers/generic_txt.py (+1 -1)
📝 archivebox/parsers/medium_rss.py (+1 -1)
📝 archivebox/parsers/netscape_html.py (+1 -1)
📝 archivebox/parsers/pinboard_rss.py (+1 -1)
📝 archivebox/parsers/pocket_html.py (+1 -1)
📝 archivebox/parsers/shaarli_rss.py (+1 -1)
📝 archivebox/themes/legacy/link_details.html (+68 -63)

📄 Description

No description provided


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/449 **Author:** [@pirate](https://github.com/pirate) **Created:** 8/18/2020 **Status:** ✅ Merged **Merged:** 8/18/2020 **Merged by:** [@pirate](https://github.com/pirate) **Base:** `master` ← **Head:** `dev` --- ### 📝 Commits (10+) - [`a682a9c`](https://github.com/ArchiveBox/ArchiveBox/commit/a682a9c478ce71becbb08c7ee14fbf0d7840243b) make all parsers accept arbitrary meta kwargs - [`15efb2d`](https://github.com/ArchiveBox/ArchiveBox/commit/15efb2d5ed1163fb5f6388646fb167efa7dd1afa) new generic_html parser for extracting hrefs - [`b0c0a67`](https://github.com/ArchiveBox/ArchiveBox/commit/b0c0a676f8255218b66cb6d7553f5bf97a2fa9ed) re-enable readability and singlefile by default now that its less noisy - [`f18d925`](https://github.com/ArchiveBox/ArchiveBox/commit/f18d92570e4d4876098a2761c0a5dcfb9c6eb198) wip attempt to fix timestamp unique constraint errors - [`c9b3bab`](https://github.com/ArchiveBox/ArchiveBox/commit/c9b3bab84d41aa8e789436130c8cf1f3df06e4b8) fix pull title not working - [`6087e30`](https://github.com/ArchiveBox/ArchiveBox/commit/6087e30d38d060599325312f0e009eaf3e282324) dont allow ui editing of db fields for now - [`273588c`](https://github.com/ArchiveBox/ArchiveBox/commit/273588c75a8e8c30e6ff1eefd9d04dc486e7001c) change main link to point to link details index to reduce confusion - [`430d517`](https://github.com/ArchiveBox/ArchiveBox/commit/430d51756b861e93540dcc837c591e35386ef1c5) show archive dir size in link details page - [`e29bfba`](https://github.com/ArchiveBox/ArchiveBox/commit/e29bfba0531845efd135028002dfae35e92ebb24) change iframe panel size and shape on details index - [`61ab952`](https://github.com/ArchiveBox/ArchiveBox/commit/61ab952dab4c74fe29aee267ed8ea540fb0fe94f) fix parser docstring ### 📊 Changes **19 files changed** (+219 additions, -136 deletions) <details> <summary>View changed files</summary> 📝 `archivebox.egg-info/SOURCES.txt` (+1 -0) 📝 `archivebox/config/__init__.py` (+2 -2) 📝 `archivebox/core/admin.py` (+5 -5) 📝 `archivebox/extractors/__init__.py` (+2 -2) 📝 `archivebox/index/__init__.py` (+23 -18) 📝 `archivebox/index/html.py` (+2 -0) 📝 `archivebox/index/sql.py` (+4 -0) 📝 `archivebox/main.py` (+3 -3) 📝 `archivebox/parsers/__init__.py` (+48 -35) ➕ `archivebox/parsers/generic_html.py` (+53 -0) 📝 `archivebox/parsers/generic_json.py` (+1 -1) 📝 `archivebox/parsers/generic_rss.py` (+1 -1) 📝 `archivebox/parsers/generic_txt.py` (+1 -1) 📝 `archivebox/parsers/medium_rss.py` (+1 -1) 📝 `archivebox/parsers/netscape_html.py` (+1 -1) 📝 `archivebox/parsers/pinboard_rss.py` (+1 -1) 📝 `archivebox/parsers/pocket_html.py` (+1 -1) 📝 `archivebox/parsers/shaarli_rss.py` (+1 -1) 📝 `archivebox/themes/legacy/link_details.html` (+68 -63) </details> ### 📄 Description _No description provided_ --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-01 14:48:43 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#1169
No description provided.