[PR #680] [MERGED] v0.6.0 Release #4277

Closed
opened 2026-03-15 01:35:48 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/680
Author: @pirate
Created: 3/27/2021
Status: Merged
Merged: 3/31/2021
Merged by: @pirate

Base: masterHead: dev


📝 Commits (10+)

  • 90ef5e1 fix dangling small and update apt sources instructions
  • 7b1b1a2 make arm support for apt explicit
  • 9ac1f8c Update README.md
  • 7d62fc2 fix macOS unnecessary version limitation
  • 2b9282e send some love to monadical
  • af09ac0 Update README.md
  • 7e18fb8 Update README.md
  • 8439f3f Update README.md
  • e8069f8 Update README.md
  • 188670e disable sonic by default in docker-compose and add instructions

📊 Changes

74 files changed (+2061 additions, -767 deletions)

View changed files

📝 .github/ISSUE_TEMPLATE/bug_report.md (+2 -2)
📝 .gitmodules (+3 -0)
📝 Dockerfile (+17 -12)
📝 README.md (+59 -29)
📝 archivebox/cli/__init__.py (+5 -1)
📝 archivebox/cli/archivebox_add.py (+12 -1)
📝 archivebox/cli/archivebox_config.py (+4 -1)
📝 archivebox/cli/archivebox_init.py (+6 -0)
📝 archivebox/cli/archivebox_list.py (+11 -11)
📝 archivebox/cli/archivebox_oneshot.py (+4 -1)
📝 archivebox/cli/archivebox_remove.py (+4 -1)
📝 archivebox/cli/archivebox_server.py (+13 -2)
📝 archivebox/cli/archivebox_update.py (+7 -3)
archivebox/cli/tests.py (+227 -0)
📝 archivebox/config.py (+62 -10)
📝 archivebox/core/admin.py (+162 -71)
📝 archivebox/core/forms.py (+2 -1)
archivebox/core/migrations/0009_auto_20210216_1038.py (+18 -0)
archivebox/core/migrations/0010_auto_20210216_1055.py (+18 -0)
archivebox/core/migrations/0011_auto_20210216_1331.py (+24 -0)

...and 54 more files

📄 Description

new features:

  • new ArchiveResult admin UI, with full editing ability of individual extractor outputs + list of outputs under each Snapshot admin entry
  • add init --quick and server --quick-init options to quickly update the db version without doing a full re-init (for users with large archive collections this will make version upgrades a lot faster / less painful)
  • new SNAPSHOTS_PER_PAGE=40 and MEDIA_MAX_SIZE=750m config options
  • log all errors with full tracebacks to new data/logs/errors.log file (so users no longer have to run in --debug mode to see error details)
  • allow hotlinking directly to specific extractor output on the snapshot detail page using URL #hash e.g. /archive/<timestamp>/index.html#git
  • add ability to view snapshot matching a given URLs by visiting /archive/https://example.com/some/url -> redirects to -> /archive/<timestamp>/index.html (also works without scheme /archive/example.com)
  • add Django Debug Toolbar + djdt_flamegraph for developers to profile UI performance
  • new archivebox add --tag=tag1,tag2,tag3 ... option and Add page UI form field for tagging new links
  • new archivebox schedule --overwrite flag option that works similarly to add --overwrite

enhancements:

  • lots of performance improvements! (in testing with 50k entries, the main index was brought down from 10-14 second load times to 114ms once cache warms up)
  • speed up main codebae hotspots snapshot_icons, latest_title, archive_size using django cache
  • integrity and correctness improvements to readability, mercury, warc, and other extractors
  • lots of small UI improvements and CLI improvements
  • better archivebox schedule logging and changed logfile location to ./logs/schedule.log

bugfixes:

  • fix stdin/stdout/stderr handling for some edge cases in Docker/Docker-Compose
  • lots of minor template fixes
  • fix UTF-8 encoding encoding problems with file reading/writing on Windows (supporting a Python pkg on Windows is unreasonably painful ya'll)

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/680 **Author:** [@pirate](https://github.com/pirate) **Created:** 3/27/2021 **Status:** ✅ Merged **Merged:** 3/31/2021 **Merged by:** [@pirate](https://github.com/pirate) **Base:** `master` ← **Head:** `dev` --- ### 📝 Commits (10+) - [`90ef5e1`](https://github.com/ArchiveBox/ArchiveBox/commit/90ef5e14b5b690e655e41e935ae53008fa22f35b) fix dangling small and update apt sources instructions - [`7b1b1a2`](https://github.com/ArchiveBox/ArchiveBox/commit/7b1b1a237496a81225f6ff7796d3c2d09292cf26) make arm support for apt explicit - [`9ac1f8c`](https://github.com/ArchiveBox/ArchiveBox/commit/9ac1f8c5a1bd05b2dcb7fd4acc45848fe9183e17) Update README.md - [`7d62fc2`](https://github.com/ArchiveBox/ArchiveBox/commit/7d62fc23fae7dbedf5f8bfaa67184350c41d30f4) fix macOS unnecessary version limitation - [`2b9282e`](https://github.com/ArchiveBox/ArchiveBox/commit/2b9282e754771f7aa06c6c736e889ed3796bd435) send some love to monadical - [`af09ac0`](https://github.com/ArchiveBox/ArchiveBox/commit/af09ac0e7f5c6993d984c45f587c0f9a72d7b930) Update README.md - [`7e18fb8`](https://github.com/ArchiveBox/ArchiveBox/commit/7e18fb87652cf4a48f13531bcd8e1ac23ff61817) Update README.md - [`8439f3f`](https://github.com/ArchiveBox/ArchiveBox/commit/8439f3f532936537f5916d3a28b761d56576d973) Update README.md - [`e8069f8`](https://github.com/ArchiveBox/ArchiveBox/commit/e8069f8043999dcc9d481c826a4ceb10f76e6bcc) Update README.md - [`188670e`](https://github.com/ArchiveBox/ArchiveBox/commit/188670eb8be643ed7d38d4db32a2d8fe1eb99b4e) disable sonic by default in docker-compose and add instructions ### 📊 Changes **74 files changed** (+2061 additions, -767 deletions) <details> <summary>View changed files</summary> 📝 `.github/ISSUE_TEMPLATE/bug_report.md` (+2 -2) 📝 `.gitmodules` (+3 -0) 📝 `Dockerfile` (+17 -12) 📝 `README.md` (+59 -29) 📝 `archivebox/cli/__init__.py` (+5 -1) 📝 `archivebox/cli/archivebox_add.py` (+12 -1) 📝 `archivebox/cli/archivebox_config.py` (+4 -1) 📝 `archivebox/cli/archivebox_init.py` (+6 -0) 📝 `archivebox/cli/archivebox_list.py` (+11 -11) 📝 `archivebox/cli/archivebox_oneshot.py` (+4 -1) 📝 `archivebox/cli/archivebox_remove.py` (+4 -1) 📝 `archivebox/cli/archivebox_server.py` (+13 -2) 📝 `archivebox/cli/archivebox_update.py` (+7 -3) ➕ `archivebox/cli/tests.py` (+227 -0) 📝 `archivebox/config.py` (+62 -10) 📝 `archivebox/core/admin.py` (+162 -71) 📝 `archivebox/core/forms.py` (+2 -1) ➕ `archivebox/core/migrations/0009_auto_20210216_1038.py` (+18 -0) ➕ `archivebox/core/migrations/0010_auto_20210216_1055.py` (+18 -0) ➕ `archivebox/core/migrations/0011_auto_20210216_1331.py` (+24 -0) _...and 54 more files_ </details> ### 📄 Description new features: - new ArchiveResult admin UI, with full editing ability of individual extractor outputs + list of outputs under each Snapshot admin entry - add `init --quick` and `server --quick-init` options to quickly update the db version without doing a full re-init (for users with large archive collections this will make version upgrades a lot faster / less painful) - new `SNAPSHOTS_PER_PAGE=40` and `MEDIA_MAX_SIZE=750m` config options - log all errors with full tracebacks to new `data/logs/errors.log` file (so users no longer have to run in --debug mode to see error details) - allow hotlinking directly to specific extractor output on the snapshot detail page using URL `#hash` e.g. `/archive/<timestamp>/index.html#git` - add ability to view snapshot matching a given URLs by visiting `/archive/https://example.com/some/url` -> redirects to -> `/archive/<timestamp>/index.html` (also works without scheme `/archive/example.com`) - add Django Debug Toolbar + `djdt_flamegraph` for developers to profile UI performance - new `archivebox add --tag=tag1,tag2,tag3 ...` option and Add page UI form field for tagging new links - new `archivebox schedule --overwrite` flag option that works similarly to `add --overwrite` enhancements: - lots of performance improvements! (in testing with 50k entries, the main index was brought down from 10-14 second load times to 114ms once cache warms up) - speed up main codebae hotspots `snapshot_icons`, `latest_title`, `archive_size` using django cache - integrity and correctness improvements to readability, mercury, warc, and other extractors - lots of small UI improvements and CLI improvements - better `archivebox schedule` logging and changed logfile location to `./logs/schedule.log` bugfixes: - fix stdin/stdout/stderr handling for some edge cases in Docker/Docker-Compose - lots of minor template fixes - fix UTF-8 encoding encoding problems with file reading/writing on Windows (supporting a Python pkg on Windows is unreasonably painful ya'll) --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-15 01:35:48 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#4277
No description provided.