mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 17:16:00 +03:00
[GH-ISSUE #1055] Bug: SingleFile was not able to archive the page #658
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#658
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @nickali on GitHub (Nov 28, 2022).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1055
Describe the bug
Using the latest stable docker image for arm64, no matter what site I try to archive, SingleFile always gives me the same error: SingleFile was not able to archive the page. All the other options seem to work fine.
Steps to reproduce
Screenshots or log output
Log after submission
Log
This is probably easier to read:
In the logs, in the CMD STR, I see this:
CMD STR
Log
Again, if easier to read
If I pull up a shell in the docker instance and run the above command, I get:
If I remove the user-agent from that command, I get 'Unexpected end of JSON.'
I tried setting the CHROME_USER_AGENT to
under archivebox -> environment in docker-compose.yml, I get the same 'Unexpected end of JSON" error.
ArchiveBox version
docker-compose was left untouched (except for testing out the user agent change once).
Running Ubuntu 22.04.1
Docker version: Docker version 20.10.21, build baeda1f
@pirate commented on GitHub (Nov 29, 2022):
Can you try running the most basic case outside of docker to confirm singlefile works on that URL:
Also verify that the exit 0 doesn't indicate it actually succeeded (but shows the error message anyway) by checkiing for
./data/archive/<timestamp>/singlefile.htmlin one of the failed snapshot output folders. Ifsinglefile.htmlis present and valid then it's just a bug in the error output / output parsing and not actually in singlefile.@nickali commented on GitHub (Nov 30, 2022):
The /data/archive directory has the json file and the index.html with the archive types, but no HTML file with contents of the website.
I should have been more clear, the betanews site is not the only one causing issues, but any site I tried.
I installed single-file outside of docker using npm and tried the following:
The betanews.html contained the html as expected.