mirror of
https://github.com/go-shiori/shiori.git
synced 2026-04-25 06:25:54 +03:00
[GH-ISSUE #1122] Unable to save large article #465
Labels
No labels
component:backend
component:builds
component:builds
component:extension
component:frontend
component:readability
database
database:mysql
database:postgres
database:sqlite
feature:ebooks
github_actions
good first issue
hacktoberfest
note:duplicate?
note:fixed?
note:out-of-scope?
os:windows
priority:high
priority:low
pull-request
resolution:as-intended
resolution:cant-reproduce
resolution:duplicate
resolution:fixed
resolution:wontfix
tag:TBD
tag:big-task
tag:help-wanted
tag:huge-data
tag:meta
tag:more-info
tag:next
tag:no-stale
tag:requires-migrations
tag:research
tag:security 🛡️
tag:stale
tag:waiting-for-assignee
type:bug
type:documentation
type:enhancement
type:meta
type:ux
user:cli
user:web
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/shiori#465
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Australis86 on GitHub (Jun 27, 2025).
Original GitHub issue: https://github.com/go-shiori/shiori/issues/1122
Data
Describe the bug / actual behavior
I am unable to cache and archive this article via either the web interface (running on my LAN) or the addon (connected to the instance on my LAN): https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html
It is a relatively large page, which I suspect is the issue (I have already had to set the client_max_body_size quite high in nginx to avoid a 413 error). This is now the log output showing shiori failing to handle the article:
Jun 28 00:29:30 HOSTNAME shiori[24639]: time="2025-06-28T00:29:30+10:00" level=error msg="error downloading bookmark cache" bookmark_id=1743 error="failed to parse article: failed to parse input: transform: short internal buffer" url="https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html" Jun 28 00:29:34 HOSTNAME shiori[24639]: time="2025-06-28T00:29:34+10:00" level=error msg="Error #01: write tcp 127.0.0.1:53001->127.0.0.1:37170: write: broken pipe\n" clientIP=PC_IP dataLength=3941 hostname=HOSTNAME latency=65140 method=PUT path=/api/v1/bookmarks/cache referer="https://RASPERRY_PI_IP/shiori/?search=-tag%3Aread%20tag%3Amysteries" statusCode=200 userAgent="Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:140.0) Gecko/20100101 Firefox/140.0"In the shiori web UI, I get a "504 Gateway Timeout" message back from nginx. When using the Firefox addon, no error message is returned.
If I remove the buffer restrictions in nginx and beef up the proxy read/write times, then try to update the article in the web UI, I get only this in the log:
Jun 28 01:34:24 HOSTNAME shiori[31550]: time="2025-06-28T01:34:24+10:00" level=error msg="error downloading bookmark cache" bookmark_id=1743 error="failed to process article: context deadline exceeded (Client.Timeout or context cancellation while reading body)" url="https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html"The web UI then reports that it couldn't update the bookmark.
If I use the addon, it fails silently - no error in the log but no update to the bookmark either.
Expected behavior