mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-27 10:06:00 +03:00
[GH-ISSUE #920] Question: Snapshot exists in DB, but resource /singlefile.html does not exist in snapshot dir yet. #2082
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#2082
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @2600box on GitHub (Jan 31, 2022).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/920
I have just set up archive box on ubuntu 20.04 with the script. No errors in packages and all looks good.
When I archive a site, I can see different results, but the singlefile, chrome and git have this messages:
I can see in the log errors like:
Not Found: /archive/1643631596.804901/singlefile.htmlIf I run the suggested command, it does not fix the issue and this is the result:
Is this a configuration issue? What am I doing wrong?
Thanks!
@pirate commented on GitHub (Mar 16, 2022):
Please post the full output of
archivebox --version.@rickcecil commented on GitHub (Aug 29, 2022):
So I am experiencing this error as well. The full output of "archivebox --version" is below.
Any help would be greatly appreciated.
Some additional details: I am importing new links via text file imported through CLI. There are Between 1000-2500 links per text file. The logs indicate that everything that all download attempts have been successful.
From the UI, the system indicates that it is successful, but "size" is blank.
If I pull these items again, it works successfully.
It is difficult to determine how many items have failed. but it does appear to be around a thousand or more.
Let me know if you need any additional information.
Right now, my solution is to go back through page-by-page and re-pull the pages. It is somewhat tedious, but not too bad.
It would be great if you had a) additional troubleshooting tips; b) and instructions (if possible) on how to repull the items that are missing data (I investigated this option, but did not find anything. I will look more later. Sometimes I miss things that are right in front of me.)
@vcudachi commented on GitHub (Aug 27, 2024):
Almost the same issue for me. I have added a link to archive the web page. Got the result: ✅ 4 ❌ 9 and title "Pending...". Then nothing happens, and the logs are empty. The message is:
Next, I ran the command archivebox update -t timestamp 1724776314.904378 in the terminal and archiving completed successfully.
Is this normal behavior?
The output of version command:
@pirate commented on GitHub (Sep 6, 2024):
This should be fixed in the latest v0.8.3-rc pre-release, adding via the UI now starts the job in a background thread so it shouldnt appear "stalled" like you saw.
https://github.com/ArchiveBox/ArchiveBox/releases/tag/v0.8.3-rc
Give that a try and let me know if you still encounter any issues.