[GH-ISSUE #712] 403 forbidden request forbidden by administrative rules #350

Closed
opened 2026-02-25 23:34:01 +03:00 by kerem · 8 comments
Owner

Originally created by @Tataukee on GitHub (Sep 3, 2023).
Original GitHub issue: https://github.com/go-shiori/shiori/issues/712

Unable to save this article , the link either displaying the error message indicated in the title of this issue or opening the link to its original source page .
The article is easily saved to other read later / bookmarks apps .

Originally created by @Tataukee on GitHub (Sep 3, 2023). Original GitHub issue: https://github.com/go-shiori/shiori/issues/712 Unable to save [this article](https://actu.fr/normandie/rouen_76540/rouen-des-familles-au-tribunal-pour-defendre-l-ecole-a-la-maison-on-nous-taxe-de-separatistes_59971016.html) , the link either displaying the error message indicated in the title of this issue or opening the link to its original source page . The article is easily saved to other read later / bookmarks apps .
kerem 2026-02-25 23:34:01 +03:00
  • closed this issue
  • added the
    tag:stale
    label
Author
Owner

@Monirzadeh commented on GitHub (Sep 3, 2023):

more context:
go-readability itself can paras that without problem.

<!-- gh-comment-id:1704073632 --> @Monirzadeh commented on GitHub (Sep 3, 2023): more context: [go-readability](https://github.com/go-shiori/go-readability) itself can paras that without problem.
Author
Owner

@Tataukee commented on GitHub (Sep 3, 2023):

Thank you .
Im using Portainer stack to deploy Shiori .
Do you have an idea how to deploy/integrate go-readability ?

<!-- gh-comment-id:1704076679 --> @Tataukee commented on GitHub (Sep 3, 2023): Thank you . Im using Portainer stack to deploy Shiori . Do you have an idea how to deploy/integrate [go-readability](https://github.com/go-shiori/go-readability) ?
Author
Owner

@Monirzadeh commented on GitHub (Sep 3, 2023):

it is problem in shioiri side (shiori use go-readability to get readable version)
if you need to test that just clone repository update example/main.go and add url you want than run go run example/main.go
i am not sure but somthing should be different between request of go-shiori and go-readability (maybe in header) that website detect that and return 403
do you have same problem with any other website?

<!-- gh-comment-id:1704089719 --> @Monirzadeh commented on GitHub (Sep 3, 2023): it is problem in shioiri side (shiori use go-readability to get readable version) if you need to test that just clone repository update `example/main.go` and add url you want than run `go run example/main.go` i am not sure but somthing should be different between request of `go-shiori` and `go-readability` (maybe in header) that website detect that and return 403 do you have same problem with any other website?
Author
Owner

@Tataukee commented on GitHub (Sep 3, 2023):

yes I do actually have a similar problem with another site getting this error this time : Please enable JS and disable any ad blocker .
I do not have any ad blocker .

<!-- gh-comment-id:1704168207 --> @Tataukee commented on GitHub (Sep 3, 2023): yes I do actually have a similar problem with [another site](https://www.names.org/n/huset/about) getting this error this time : Please enable JS and disable any ad blocker . I do not have any ad blocker .
Author
Owner

@Monirzadeh commented on GitHub (Sep 3, 2023):

this one is a little different first one just block when request send from shiori
but the second happen because that site detect you robot (because of IP or can't fingerprint you) that detect my normal browser as robot too. Please enable JS and disable any ad blocker is part of page that show you to solve CAPTCHA that go-readability extract that.

@fmartingr if i remember correctly you change header request on one of the recent commit is it possible to cause this?

<!-- gh-comment-id:1704301008 --> @Monirzadeh commented on GitHub (Sep 3, 2023): this one is a little different first one just block when request send from shiori but the second happen because that site detect you robot (because of IP or can't fingerprint you) that detect my normal browser as robot too. `Please enable JS and disable any ad blocker` is part of page that show you to solve CAPTCHA that `go-readability` extract that. @fmartingr if i remember correctly you change header request on one of the recent commit is it possible to cause this?
Author
Owner

@fmartingr commented on GitHub (Sep 9, 2023):

The first article is failing because of the unrecognised user agent we send when downloading a page:

$ curl --user-agent "Shiori/2.0.0 (+https://github.com/go-shiori/shiori)" https://actu.fr/normandie/rouen_76540/rouen-des-familles-au-tribunal-pour-defendre-l-ecole-a-la-maison-on-nous-taxe-de-separatistes_59971016.html
<html><body><h1>403 Forbidden</h1>
Request forbidden by administrative rules.
</body></html>

The second one is probably failing because we do not use a browser for archiving, so javascript is not executed when doing archive. That page may have something in place to detect JS in the browser or something like that.

This raises the old issue of... should we use a browser to create archives? I have, like with everything, ideas around my head, but nothing specific nor in the roadmap at the moment.

The idea of allowing plugins into Shiori is starting to make more sense to me, but it would be an enormous effort to put through, considering there are only two active contributors at the moment.

<!-- gh-comment-id:1712441435 --> @fmartingr commented on GitHub (Sep 9, 2023): The first article is failing because of the unrecognised user agent we send when downloading a page: ``` $ curl --user-agent "Shiori/2.0.0 (+https://github.com/go-shiori/shiori)" https://actu.fr/normandie/rouen_76540/rouen-des-familles-au-tribunal-pour-defendre-l-ecole-a-la-maison-on-nous-taxe-de-separatistes_59971016.html <html><body><h1>403 Forbidden</h1> Request forbidden by administrative rules. </body></html> ``` The second one is probably failing because we do not use a browser for archiving, so javascript is not executed when doing archive. That page may have something in place to detect JS in the browser or something like that. This raises the old issue of... should we use a browser to create archives? I have, like with everything, ideas around my head, but nothing specific nor in the roadmap at the moment. The idea of allowing plugins into Shiori is starting to make more sense to me, but it would be an enormous effort to put through, considering there are only two active contributors at the moment.
Author
Owner

@Monirzadeh commented on GitHub (Sep 9, 2023):

about the second link i get that on my regular browser too.
i think it is still so soon to add them. maybe first step is to make them easier to debug.

<!-- gh-comment-id:1712473754 --> @Monirzadeh commented on GitHub (Sep 9, 2023): about the second link i get that on my regular browser too. i think it is still so soon to add them. maybe first step is to make them easier to debug.
Author
Owner

@stale[bot] commented on GitHub (Oct 10, 2023):

This issue has been automatically marked as stale because it has not had any activity for quite some time.
It will be closed if no further activity occurs.
Thank you for your contributions.

<!-- gh-comment-id:1756357710 --> @stale[bot] commented on GitHub (Oct 10, 2023): This issue has been automatically marked as stale because it has not had any activity for quite some time. It will be closed if no further activity occurs. Thank you for your contributions.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/shiori#350
No description provided.