[GH-ISSUE #1122] Unable to save large article #465

Open
opened 2026-02-25 23:34:17 +03:00 by kerem · 0 comments
Owner

Originally created by @Australis86 on GitHub (Jun 27, 2025).
Original GitHub issue: https://github.com/go-shiori/shiori/issues/1122

Data

  • Shiori version: 1.7.4
  • Database Engine: SQLite
  • Operating system: Raspbian Bullseye
  • CLI/Web interface/Web Extension: shiori binary running as systemd service (on port 53001) with nginx as reverse proxy to provide SSL

Describe the bug / actual behavior

I am unable to cache and archive this article via either the web interface (running on my LAN) or the addon (connected to the instance on my LAN): https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html

It is a relatively large page, which I suspect is the issue (I have already had to set the client_max_body_size quite high in nginx to avoid a 413 error). This is now the log output showing shiori failing to handle the article:

Jun 28 00:29:30 HOSTNAME shiori[24639]: time="2025-06-28T00:29:30+10:00" level=error msg="error downloading bookmark cache" bookmark_id=1743 error="failed to parse article: failed to parse input: transform: short internal buffer" url="https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html" Jun 28 00:29:34 HOSTNAME shiori[24639]: time="2025-06-28T00:29:34+10:00" level=error msg="Error #01: write tcp 127.0.0.1:53001->127.0.0.1:37170: write: broken pipe\n" clientIP=PC_IP dataLength=3941 hostname=HOSTNAME latency=65140 method=PUT path=/api/v1/bookmarks/cache referer="https://RASPERRY_PI_IP/shiori/?search=-tag%3Aread%20tag%3Amysteries" statusCode=200 userAgent="Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:140.0) Gecko/20100101 Firefox/140.0"

In the shiori web UI, I get a "504 Gateway Timeout" message back from nginx. When using the Firefox addon, no error message is returned.

If I remove the buffer restrictions in nginx and beef up the proxy read/write times, then try to update the article in the web UI, I get only this in the log:

Jun 28 01:34:24 HOSTNAME shiori[31550]: time="2025-06-28T01:34:24+10:00" level=error msg="error downloading bookmark cache" bookmark_id=1743 error="failed to process article: context deadline exceeded (Client.Timeout or context cancellation while reading body)" url="https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html"

The web UI then reports that it couldn't update the bookmark.

If I use the addon, it fails silently - no error in the log but no update to the bookmark either.

Expected behavior

  • Shiori should either report that the article is too large to parse or successfully archive it.
Originally created by @Australis86 on GitHub (Jun 27, 2025). Original GitHub issue: https://github.com/go-shiori/shiori/issues/1122 ## Data - **Shiori version**: 1.7.4 - **Database Engine**: SQLite - **Operating system**: Raspbian Bullseye - **CLI/Web interface/Web Extension**: shiori binary running as systemd service (on port 53001) with nginx as reverse proxy to provide SSL ## Describe the bug / actual behavior I am unable to cache and archive this article via either the web interface (running on my LAN) or the addon (connected to the instance on my LAN): https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html It is a relatively large page, which I suspect is the issue (I have already had to set the client_max_body_size quite high in nginx to avoid a 413 error). This is now the log output showing shiori failing to handle the article: `Jun 28 00:29:30 HOSTNAME shiori[24639]: time="2025-06-28T00:29:30+10:00" level=error msg="error downloading bookmark cache" bookmark_id=1743 error="failed to parse article: failed to parse input: transform: short internal buffer" url="https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html" Jun 28 00:29:34 HOSTNAME shiori[24639]: time="2025-06-28T00:29:34+10:00" level=error msg="Error #01: write tcp 127.0.0.1:53001->127.0.0.1:37170: write: broken pipe\n" clientIP=PC_IP dataLength=3941 hostname=HOSTNAME latency=65140 method=PUT path=/api/v1/bookmarks/cache referer="https://RASPERRY_PI_IP/shiori/?search=-tag%3Aread%20tag%3Amysteries" statusCode=200 userAgent="Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:140.0) Gecko/20100101 Firefox/140.0"` In the shiori web UI, I get a "504 Gateway Timeout" message back from nginx. When using the Firefox addon, no error message is returned. If I remove the buffer restrictions in nginx and beef up the proxy read/write times, then try to update the article in the web UI, I get only this in the log: `Jun 28 01:34:24 HOSTNAME shiori[31550]: time="2025-06-28T01:34:24+10:00" level=error msg="error downloading bookmark cache" bookmark_id=1743 error="failed to process article: context deadline exceeded (Client.Timeout or context cancellation while reading body)" url="https://nymag.com/intelligencer/article/mh370-search-debris-barnacles.html" ` The web UI then reports that it couldn't update the bookmark. If I use the addon, it fails silently - no error in the log but no update to the bookmark either. ## Expected behavior - Shiori should either report that the article is too large to parse or successfully archive it.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/shiori#465
No description provided.