[PR #552] [MERGED] v0.5.0 Release #4236

Closed
opened 2026-03-15 01:33:39 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/552
Author: @pirate
Created: 11/28/2020
Status: Merged
Merged: 1/6/2021
Merged by: @pirate

Base: masterHead: v0.5.0


📝 Commits (10+)

  • f292cfa fix: Add condition for oneshot when archiving links
  • d064a3e fix: Handle case when update tries to re-add a link that is not in the sql index
  • 33182fd fix: Add missing assignation
  • 7165522 feat: Add warc to list and limit check to succeeded archive results
  • 508a0bb refactor: Unpack extractors tuple instead of using the index to access the relevant information
  • f7f0beb feat: Modify migration reverse function to restore index (WIP)
  • b237e41 feat: Finish reversal. Add ArchiveResults that are not found in the index.json
  • e594e6a feat: WARC link points to the first warc result in target path
  • 8cfad64 feat: Add specific logic for archive_org icon
  • c565fad feat: Use prefetch related to reduce the number of queries to the database on public index view

📊 Changes

106 files changed (+2131 additions, -1844 deletions)

View changed files

📝 .dockerignore (+6 -0)
.github/workflows/debian.yml (+76 -0)
📝 .github/workflows/docker.yml (+16 -1)
.github/workflows/homebrew.yml (+50 -0)
.github/workflows/lint.yml (+34 -0)
.github/workflows/pip.yml (+61 -0)
📝 .github/workflows/test.yml (+22 -34)
📝 .gitignore (+2 -1)
📝 .gitmodules (+24 -2)
📝 Dockerfile (+16 -8)
📝 README.md (+171 -132)
archivebox.egg-info/PKG-INFO (+0 -541)
archivebox.egg-info/SOURCES.txt (+0 -128)
archivebox.egg-info/dependency_links.txt (+0 -1)
archivebox.egg-info/entry_points.txt (+0 -3)
archivebox.egg-info/requires.txt (+0 -25)
archivebox.egg-info/top_level.txt (+0 -1)
archivebox/base32_crockford.py (+0 -172)
📝 archivebox/cli/__init__.py (+8 -0)
📝 archivebox/cli/archivebox_add.py (+1 -1)

...and 80 more files

📄 Description

  • ArchiveResult moved to SQLite3 DB for performance @cdvv7788
  • lots of assorted bugfixes and improvements courtesy of @cdvv7788 and @jdcaballerov
  • new full-text search support with ripgrep and sonic courtesy of @jdcaballerov
  • new archivebox oneshot command for downloading a single site without starting a whole collection
  • new Pocket API importer courtesy of @mAAdhaTTah
  • new Wallabag importer courtesy of @ehainry
  • new extractor options on Add page courtesy of @BlipRanger
  • new apt/deb/homebrew/pip packaging setup into separate repos under new Github Org https://github.com/ArchiveBox
  • new official PPA and Docker Hub accounts https://hub.docker.com/r/archivebox/archivebox (with automatic armv7 builds courtesy of @chrismeller)

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/552 **Author:** [@pirate](https://github.com/pirate) **Created:** 11/28/2020 **Status:** ✅ Merged **Merged:** 1/6/2021 **Merged by:** [@pirate](https://github.com/pirate) **Base:** `master` ← **Head:** `v0.5.0` --- ### 📝 Commits (10+) - [`f292cfa`](https://github.com/ArchiveBox/ArchiveBox/commit/f292cface27e6de0a552d2fc1e78fd99f6aa9219) fix: Add condition for oneshot when archiving links - [`d064a3e`](https://github.com/ArchiveBox/ArchiveBox/commit/d064a3eeffa0a6cb52462ce1f2edb0d6be8f753a) fix: Handle case when update tries to re-add a link that is not in the sql index - [`33182fd`](https://github.com/ArchiveBox/ArchiveBox/commit/33182fd53c0d96f46576ee38551a7ac4a50ee534) fix: Add missing assignation - [`7165522`](https://github.com/ArchiveBox/ArchiveBox/commit/71655220ad8554458978a078e604cb2b57fa2e1c) feat: Add warc to list and limit check to succeeded archive results - [`508a0bb`](https://github.com/ArchiveBox/ArchiveBox/commit/508a0bb06ebd15bcb63407328a5d4747fb10d977) refactor: Unpack extractors tuple instead of using the index to access the relevant information - [`f7f0beb`](https://github.com/ArchiveBox/ArchiveBox/commit/f7f0bebdcc021623a438e7975982523cdbe8bea8) feat: Modify migration reverse function to restore index (WIP) - [`b237e41`](https://github.com/ArchiveBox/ArchiveBox/commit/b237e412df2c63399394a7ad0370096f7cd1009d) feat: Finish reversal. Add ArchiveResults that are not found in the index.json - [`e594e6a`](https://github.com/ArchiveBox/ArchiveBox/commit/e594e6a75a2895077029d97b88d7b6f8b580885f) feat: WARC link points to the first warc result in target path - [`8cfad64`](https://github.com/ArchiveBox/ArchiveBox/commit/8cfad64271cf72ed4572c4d3a2c5ff6885bc8b95) feat: Add specific logic for archive_org icon - [`c565fad`](https://github.com/ArchiveBox/ArchiveBox/commit/c565fad75cf5f6256a0ce70febb7c2246cbd1b42) feat: Use prefetch related to reduce the number of queries to the database on public index view ### 📊 Changes **106 files changed** (+2131 additions, -1844 deletions) <details> <summary>View changed files</summary> 📝 `.dockerignore` (+6 -0) ➕ `.github/workflows/debian.yml` (+76 -0) 📝 `.github/workflows/docker.yml` (+16 -1) ➕ `.github/workflows/homebrew.yml` (+50 -0) ➕ `.github/workflows/lint.yml` (+34 -0) ➕ `.github/workflows/pip.yml` (+61 -0) 📝 `.github/workflows/test.yml` (+22 -34) 📝 `.gitignore` (+2 -1) 📝 `.gitmodules` (+24 -2) 📝 `Dockerfile` (+16 -8) 📝 `README.md` (+171 -132) ➖ `archivebox.egg-info/PKG-INFO` (+0 -541) ➖ `archivebox.egg-info/SOURCES.txt` (+0 -128) ➖ `archivebox.egg-info/dependency_links.txt` (+0 -1) ➖ `archivebox.egg-info/entry_points.txt` (+0 -3) ➖ `archivebox.egg-info/requires.txt` (+0 -25) ➖ `archivebox.egg-info/top_level.txt` (+0 -1) ➖ `archivebox/base32_crockford.py` (+0 -172) 📝 `archivebox/cli/__init__.py` (+8 -0) 📝 `archivebox/cli/archivebox_add.py` (+1 -1) _...and 80 more files_ </details> ### 📄 Description - ArchiveResult moved to SQLite3 DB for performance @cdvv7788 - lots of assorted bugfixes and improvements courtesy of @cdvv7788 and @jdcaballerov - new full-text search support with ripgrep and sonic courtesy of @jdcaballerov - new `archivebox oneshot` command for downloading a single site without starting a whole collection - new Pocket API importer courtesy of @mAAdhaTTah - new Wallabag importer courtesy of @ehainry - new extractor options on Add page courtesy of @BlipRanger - new apt/deb/homebrew/pip packaging setup into separate repos under new Github Org https://github.com/ArchiveBox - new official PPA and Docker Hub accounts https://hub.docker.com/r/archivebox/archivebox (with automatic armv7 builds courtesy of @chrismeller) --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-15 01:33:39 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#4236
No description provided.