mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 17:16:00 +03:00
[GH-ISSUE #674] Failed to archive link: UnicodeEncodeError: 'gbk' codec can't encode character '\u25be' in position 9443: illegal multibyte sequence #3444
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#3444
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @littlegolden on GitHub (Mar 26, 2021).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/674
similar with #32, but it happens when archiving the site:
@pirate commented on GitHub (Mar 26, 2021):
I think this is a different issue, can you share what the URL is that you tried to archive?
Also please post the full output of
archivebox --version.@pirate commented on GitHub (Mar 27, 2021):
I think it's due to windows not defaulting to UTF-8 for file writes.
There's a PEP to fix it, but it's not proposed to land until 3.10: https://discuss.python.org/t/pep-597-enable-utf-8-mode-by-default-on-windows/3122
In the meantime can you try setting the
PYTHONLEGACYWINDOWSSTDIO=utf-8environment variable and running it again.Related issue: https://github.com/ArchiveBox/ArchiveBox/issues/678
In the meantime I've added a patch to v0.6 that should fix this issue: 71e632a
You can try it out like so:
Post back if you're still encountering the problem and I'll reopen the ticket.
I've also added some fixes to v0.6 that should improve the situation.