[PR #680] v0.6.0 Release #2770

Closed
opened 2026-03-01 18:00:42 +03:00 by kerem · 0 comments
Owner

Original Pull Request: https://github.com/ArchiveBox/ArchiveBox/pull/680

State: closed
Merged: Yes


new features:

  • new ArchiveResult admin UI, with full editing ability of individual extractor outputs + list of outputs under each Snapshot admin entry
  • add init --quick and server --quick-init options to quickly update the db version without doing a full re-init (for users with large archive collections this will make version upgrades a lot faster / less painful)
  • new SNAPSHOTS_PER_PAGE=40 and MEDIA_MAX_SIZE=750m config options
  • log all errors with full tracebacks to new data/logs/errors.log file (so users no longer have to run in --debug mode to see error details)
  • allow hotlinking directly to specific extractor output on the snapshot detail page using URL #hash e.g. /archive/<timestamp>/index.html#git
  • add ability to view snapshot matching a given URLs by visiting /archive/https://example.com/some/url -> redirects to -> /archive/<timestamp>/index.html (also works without scheme /archive/example.com)
  • add Django Debug Toolbar + djdt_flamegraph for developers to profile UI performance
  • new archivebox add --tag=tag1,tag2,tag3 ... option and Add page UI form field for tagging new links
  • new archivebox schedule --overwrite flag option that works similarly to add --overwrite

enhancements:

  • lots of performance improvements! (in testing with 50k entries, the main index was brought down from 10-14 second load times to 114ms once cache warms up)
  • speed up main codebae hotspots snapshot_icons, latest_title, archive_size using django cache
  • integrity and correctness improvements to readability, mercury, warc, and other extractors
  • lots of small UI improvements and CLI improvements
  • better archivebox schedule logging and changed logfile location to ./logs/schedule.log

bugfixes:

  • fix stdin/stdout/stderr handling for some edge cases in Docker/Docker-Compose
  • lots of minor template fixes
  • fix UTF-8 encoding encoding problems with file reading/writing on Windows (supporting a Python pkg on Windows is unreasonably painful ya'll)
**Original Pull Request:** https://github.com/ArchiveBox/ArchiveBox/pull/680 **State:** closed **Merged:** Yes --- new features: - new ArchiveResult admin UI, with full editing ability of individual extractor outputs + list of outputs under each Snapshot admin entry - add `init --quick` and `server --quick-init` options to quickly update the db version without doing a full re-init (for users with large archive collections this will make version upgrades a lot faster / less painful) - new `SNAPSHOTS_PER_PAGE=40` and `MEDIA_MAX_SIZE=750m` config options - log all errors with full tracebacks to new `data/logs/errors.log` file (so users no longer have to run in --debug mode to see error details) - allow hotlinking directly to specific extractor output on the snapshot detail page using URL `#hash` e.g. `/archive/<timestamp>/index.html#git` - add ability to view snapshot matching a given URLs by visiting `/archive/https://example.com/some/url` -> redirects to -> `/archive/<timestamp>/index.html` (also works without scheme `/archive/example.com`) - add Django Debug Toolbar + `djdt_flamegraph` for developers to profile UI performance - new `archivebox add --tag=tag1,tag2,tag3 ...` option and Add page UI form field for tagging new links - new `archivebox schedule --overwrite` flag option that works similarly to `add --overwrite` enhancements: - lots of performance improvements! (in testing with 50k entries, the main index was brought down from 10-14 second load times to 114ms once cache warms up) - speed up main codebae hotspots `snapshot_icons`, `latest_title`, `archive_size` using django cache - integrity and correctness improvements to readability, mercury, warc, and other extractors - lots of small UI improvements and CLI improvements - better `archivebox schedule` logging and changed logfile location to `./logs/schedule.log` bugfixes: - fix stdin/stdout/stderr handling for some edge cases in Docker/Docker-Compose - lots of minor template fixes - fix UTF-8 encoding encoding problems with file reading/writing on Windows (supporting a Python pkg on Windows is unreasonably painful ya'll)
kerem 2026-03-01 18:00:42 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2770
No description provided.