mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 09:06:02 +03:00
[GH-ISSUE #1510] Bug: Attempting to removed failed "Archive again" result relating to a pre 0.8.3 snapshot results in Archivebox attempting to delete EVERY entry in the database?! #3910
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#3910
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @jessienab on GitHub (Sep 6, 2024).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1510
Describe the bug
Following up to: #1509
It seems ArchiveBox did eventually generate the "Archive again" entries for pre-0.8.3 snapshots, however it didn't archive them properly. When attempting to delete these, the following happened:
I immediately killed ArchiveBox to prevent further damage, but at this point I'll have to restore from an older backup + manually re-grab a possibly large number of URLs for sites that weren't archived in that backup... 😮💨
My fault! 🤦♀️
Steps to reproduce
Screenshots or log output
See above
ArchiveBox version
@pirate commented on GitHub (Sep 6, 2024):
A shit, looks like some bug in the form parsing for the submit action selected all the snapshots?!
I'll investigate immediately, sorry about messing up your archive. I have several intergration tests that should prevent this type of thing around the CLI commands, but this shows I need to improve them to cover more of the UI button actions.
@jessienab commented on GitHub (Sep 9, 2024):
No worries!! My fault not having functioning backups :)
I managed to grab an older DB (3 months out of date), compiled all the URLs from sources/ up to now, and am just regrabbing. Seems no website data was deleted? so at least worst case if a website is missing now in the archive index, at least the older archived data is still present on disk (I can grep around to find it 👍 )
Thanks again and I guess lesson for me to make a backup (as you indicated and I did not read hehe) before running betas!!!
@pirate commented on GitHub (Sep 9, 2024):
If the older data is still present on disk running
archivebox initshould also re-import it, as it will scan thearchive/folder for snapshot entries not in the DB and re-create them from thearchive/<id>/index.jsonfile saved with each snapshot output.@jessienab commented on GitHub (Sep 10, 2024):
Luck had it that I had setup rsnapshot, and I found the backup it made the day before I nuked ArchiveBox; everything restored! yay :D