[GH-ISSUE #1133] Substack crawling usually doesn't work - 403 errors #740

Open
opened 2026-03-02 11:52:20 +03:00 by kerem · 2 comments
Owner

Originally created by @patrickbolle on GitHub (Mar 17, 2025).
Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/1133

Describe the Bug

Love Hoarder! I am trying to use it to store some Substack articles, but they seem to fail at random. Trying to recrawl them just shows a 403 error.

Here's an example of one that 403s: https://joyarbitrage.substack.com/p/find-your-2ers

Not sure how to debug further. Is Hoarder trying to scrape the page / load the data via my server, and that server is being blocked by Substack potentially?

Here are the server logs:

2025-03-17T19:39:03.832Z info: [Crawler][799] Will crawl "https://joyarbitrage.substack.com/p/find-your-2ers" for link with id "pzw7ps12d8asic0ym92phb5d"

2025-03-17T19:39:03.832Z info: [Crawler][799] Attempting to determine the content-type for the url https://joyarbitrage.substack.com/p/find-your-2ers

2025-03-17T19:39:03.894Z info: [Crawler][799] Content-type for the url https://joyarbitrage.substack.com/p/find-your-2ers is "text/html; charset=utf-8"

2025-03-17T19:39:04.170Z info: [Crawler][799] Successfully navigated to "https://joyarbitrage.substack.com/p/find-your-2ers". Waiting for the page to load ...

2025-03-17T19:39:05.849Z info: [Crawler][799] Finished waiting for the page to load.

2025-03-17T19:39:05.852Z info: [Crawler][799] Successfully fetched the page content.

2025-03-17T19:39:06.069Z info: [Crawler][799] Finished capturing page content and a screenshot. FullPageScreenshot: false

2025-03-17T19:39:06.073Z info: [Crawler][799] Will attempt to extract metadata from page ...

2025-03-17T19:39:06.465Z info: [Crawler][799] Will attempt to extract readable content ...

2025-03-17T19:39:06.781Z info: [Crawler][799] Done extracting readable content.

2025-03-17T19:39:06.788Z info: [Crawler][799] Done extracting metadata from the page.

2025-03-17T19:39:06.851Z info: [Crawler][799] Stored the screenshot as assetId: 07e18c23-2172-42e3-b787-58d82574e12b

2025-03-17T19:39:06.851Z info: [Crawler][799] Downloading image from "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAKkAAADACAYAAAB/LkO9AAAMFElEQVR4Aeydc7QzTRLG67ONzPQkV59t27btSXfP2rZt27Zt27Zt79bWM3fPPafPq+x7k0xlUn/8XmPqqd/NdA2JH3/FpXznvY4SSBmGcRT8JH7cFR/m0PkLh95j2XcLgRrGMAr4CC/hJ/ETrvkI93dkrhxzKH7MsXcnjnttLtCYMYzN4R88hI/wEn5C0g9zKT/p58xBqGV1n5M/fBkzrSPQiDGMdeAbvIN/8BA+wkv4mUoKQHSLhOKd//Td4wUaEYZxPDxbcq52cABJE1lD/k8O3RdznN1DIMMYEnvAK/iVyjm4pCn4CPbud/8OxUP4tvMdgdYSw+jAI/jEVSLnMiUFXrhVvV79Lofurbl0mwpkGAMBX+AN/IFHPvFrOJKCdLgqPsOxe6FAhrEGLoQvyVAERiZpsl6teRPH2SMFSjAMeAE/QjIUjVHSdL36dw7dZ3G/u4tAxtSzC3yAF+m6sylJgV+S9dccigdyuev2AhlTx/bof+1Bla47m5Z0RVmD+yaH7s186503EshoPRuh3+h7KqdGSVccrj7KceYcgYzWcg76nA5F+iVNhytfy/oGjr1DBDJawyF1X30yFE2WpAmLS4C/CE9iPzcn0MRizKGP6GcyFE26pOlwlf+MY3FPvt3c1gJNDMbW6Bv6l6w7WyXpisPV1zj0buDyoA0EUouxAfqEfqVytlTShLC0Zn0/VzOnCKQO45S6PzEZiqZH0nS4yv/DsXi5sL9AjWOgDy9HX5KhaGolTYerP3LsPp7jTj2Bxo7RQ/7owwpDkUm6wnr1pxLWXeQg8ZYCGSNnS+SN3JN1p0k60GnWL3DoXsn3u9+6Ag0dY13ki5wHl9MkTQmOOYLiPRxnTxBoWBjIE7k65JzmbpKu9XD1Lw7FS2Xi3EsgY63ZCzkiz2EORSZpMlzlv+fYfTjHuUyggTEy5Ib8RjkUmaTpcPV9DsXt+Jp9NxNolRibISfkNa51p0m64pVWn+cwc4lAxgpcgnzSK5RM0vEThVDL+haOM0cLZEgOyAO5xOZ7ZJKucBtL8TyuFnYVaArZFfWnt22YpFrXq7/h0HuwHKTeQaApYAfUi7qTdadJOgmy5t/mUHi+fm5jgVrIxqgPdaZymqSTOFx9nKveeQK1iPNQl+qhyCRdi2cExOJN7LuHCjTBHIo6Br+X3SSdxOHqrxyKp3CYWRBogljAdmP7GxqKTNIGhqtfcOjeh2/qbiuQYrbFdmJ703WnSTpFshbf5Ni9iS/Za0OBFLEhtgvbl8ppkk4fYWnN+kGR4nSBFHD64vZoHIpM0uZvYwnFq/5RzR4oUAMciP9/9bdtmKTG4nr1T1x1n8TlwoxAI+dWu81z1XvK4v9rcpqk/9/JgJ9y6N55ZLexhH22ETnvIXL+fJnrTpPUZHVfYj9zDb/ikvUEWja4l72avZFj8VWT0yQd5nAFlv+MgDh/qsg5yqHIJLXhyv3vGQFz+ws0OPjzvZezd2McikxSG67izCDPCOjhz9lQZJI2eTLgJ1ytOFzVP69m74zft3WnSapF1i/yrRau5Cfe6PA9fm5ymqQ6h6tbL/zGhqI2S+pTJjPcbFL3CAkmKfBCdMyVEIV6o/I/cd/9bpH8D+zxXtOlP2O7zlFkHwTkjLyRO6j7kPaH/TRJGpae+PwX7mcfkVAeKVzBZecwvrkzz7fquZroetwv9uMyO4u9uxv77PXcx1Oi1/piXwO5AeSIPJEr8kXOyBu5A/QB/UBf0B/0Cf2qpW6rpMlDc/OvyI/vwTHfg4nWEWhgfGdHDu5K7udvZ5//22QdXE7khdyQH3IUaGDQJ/QLfUP/ImRtjaTJFfHfksJKjjtsLtCy8dnxEtjbBgjMhjnkJHkJtFzQP/QR/WzHix1Czb+luCfyrbMdBBoq+Ar32fW4ECQNzFi6QAb5pHusoYB+oq/orzBhkqY3v/2KQ3aJQCPllmJXCeqDJmoi6AeRi0AjBf1Fn6ObEEkTQfPv8S3ZIQKNhVtvuyX7/FXSIBMUOUgeAo0F9Bn9jk67pImgP+Cys49AYwUPS/D5K9GoKRb0lY08DAP9Rt+jUy6pF/D8y1vckQI1wo3bbyHbgkvhpm6CR92oX6BGQN/Rf69Z0oCg3E0CNUroLLB3P+EwPVM86kXdAjUK+h+cUkmxqymzVwikg86VaVjtBXWiXoFUAA8qp0xSX/Mb+Xd2EkgFOOzSz9/M0U3Dbv7NqFcgDcAD+CAokDT5FM0fKpAqcGrPu3+0WVLUhzoFUgV8qJwSSb3Qd7/lm/JZgdRRuje19dMUdaE+gdQBH+CFb1TS5FP0xQKpxOcXtFlS1CeQSuBF5RRIWgvgzhNIJddvvbVs5484tG+iR12oTyCVwIvYtKS+/v6XyVU1GulnL2/bAX7Ug7oEUgu8qP1oRNJkd/M+gVRTutu0UVLUJZBq4Ed0TUiaXIL3RIFUU2YntHF3j7oEUg38qBqXNL+9QKrpd3eR7f0r+/bc/oF6UJdAmoEfzUu6eIsBaQbXPmJt1DJJf4m6BNIM/GhW0lifCr1EINVUxXayvT9vmaQ/R10CqQZ+RJPUJDVJ17gmvUYg1cQdMtneX7dM0l+jLoE0Az8UrEnzuwukmluyvbjfrnP4qAd1CaQa+KFgcHqOQKrBveXBte7yPNQlkGbgR/ODUz//FN+P1hVILT6/ZxsP5qMugdQCL+BHdI2fFv0TnnghkFrK/O1tu8gE9aAugdQCL+CHV3AVVHLLiDZuLrqyneklY+0Znn6L+gRSCbxQcxVUmb9T05XhCWUW0qDaA+pCfQKpAz7Ai6hBUuDd3znkBwmkipI2wJqorfc6oS7UhzoFUgV8gBeqbh/p5y8RSBfZxS0VNBEVdQqkCfig4vaRhFAfhzxWjaDXdDbjvvvCNEiKOlGvQCqAB4s+6JE0ORylJawye8C0PMkEdaJegRoH/U8PO2mRNNntP7b5M0zuZA7ub+yn58nNqBd1C9Qo6H/l1D9m5z/y47IxQYPbjb37Ybqbbz+oF3WjfoGaAH1H/1U/Zif5qu53rhBorNyqM88+/zJHN81PdP4ychBorKDf6d5LraSJqOP9RN1xP/b5V6dW0FTUryIPgcZDXqaCapc0EbX+8aNHPkzhmkWf/zwR1ET9OXIZw5D0aPQ5EXRiJE3fw/kxviU7TqChglOCIX+6BPQfDiZnQhCQC/IZxalT9BN9rVxLXuwQ3eIzi0L+/KGcmSq3z+vrFL378RpCMiQf5IS8kJtAywL9Qx/Rz+ha9oocvxQY1i5v4TK7jqt8VqCBCDPbcN+dwj5/mvDT9J1CxoDvzvop8kOOyFOgQUCf0C/0Df1L3pnaJklXeBsbwNvX8CKrEsHld2CfXSq/dk5N6c6XX/MSysPZ52+WH/8QYQ8jIHtZb87IE7kiX+SMvJE7QB/QD/QF/UGflnrmp+3dov9t7y7BvNjiOA5PL+tuaW+6DkQ20h/cHdYiWvCIW6Ti7u7WE4mK9AeHH2dwdznhDW/768ynnvm+OR9Y6ntHv8nGTK99/pGCSEGkb0KkIFJECiIFkSJSECkinXhFpGQbaeqziFWjDkVPQ0RPE2SmIco+i5gzqDF6mwancruyAqnLss8y0qzB97wZRAoiRaQgUkQKIgWRIlIQKYgUkYJIESmIFESKSEGkUMTsckGitfMrMKPljx/Ndf2I1GcRa8Zsie76u9HddA/ykrpMfRaxcerV6K5zfJb8lF2mPlOkUy7FzLpI5UJeyi5TnyJFpCBSRAoiBZEiUhApIhUpIgWRIlIQKYgUkYJIyZ1IEenUyyIl20hTn0Wsn3Aq+uoj+pohM/VR9lnEsq7O6GkaFt2Nw/kSLT9RI28qu0x9lpFC1r7nzSBSECkiBZEiUhApiBSRgkhBpIgURIpIQaQgUkQKIoUi5v9VFX0t/0ZP8398qY6foJl3pS7LPotYO2Z3dNc/ju6mJ5CX1GXq8/lso3P35Gim2UY8ZgdEikhBpCBSRAoiJWciBZEiUhApIgWRgkgRKYgURIrZRvhhs43rJlyIGZURM2r4rNqIadURU6t+vPS55ee7xm+qjLLPIjb1DYkdC8/GziWXYgeftvhCHFhxLY5tvB3HNtz8cTbeikNrrqd7cNE1fmFnkros+3wKY1ZelSrvOhIAAAAASUVORK5CYII="

2025-03-17T19:39:06.852Z info: [Crawler][799] Downloaded image as assetId: 85e7a266-1a69-43e4-9c7b-71b3f9fccb0b

2025-03-17T19:39:07.814Z info: [Crawler][799] Completed successfully

2025-03-17T19:39:08.095Z info: [VideoCrawler][802] Skipping video download from "https://joyarbitrage.substack.com/p/find-your-2ers", because it is disabled in the config.

2025-03-17T19:39:08.096Z info: [VideoCrawler][802] Video Download Completed successfully

2025-03-17T19:39:08.258Z info: [search][801] Attempting to index bookmark with id pzw7ps12d8asic0ym92phb5d ...

2025-03-17T19:39:08.358Z info: [inference][800] Starting an inference job for bookmark with id "pzw7ps12d8asic0ym92phb5d"

2025-03-17T19:39:08.611Z info: [webhook][803] Starting a webhook job for bookmark with id "pzw7ps12d8asic0ym92phb5d"

2025-03-17T19:39:08.611Z info: [webhook][803] Completed successfully

2025-03-17T19:39:08.715Z info: [inference][800] Inferring tag for bookmark "pzw7ps12d8asic0ym92phb5d" used 242 tokens and inferred:

2025-03-17T19:39:08.772Z info: [inference][800] Completed successfully

2025-03-17T19:39:09.382Z info: [search][801] Completed successfully

2025-03-17T19:39:09.549Z info: [search][804] Attempting to index bookmark with id pzw7ps12d8asic0ym92phb5d ...

2025-03-17T19:39:10.589Z info: [search][804] Completed successfully

Steps to Reproduce

  1. Go to https://joyarbitrage.substack.com/p/find-your-2ers
  2. Use Hoarder extension or otherwise to input into Hoarder
  3. See if it loads data or not.
  4. Check Broken Links page for 403 error.

Expected Behaviour

Hoarder should load the data properly for all Substack pages.

Screenshots or Additional Context

Image
Image

Device Details

Firefox on Linux.

Exact Hoarder Version

0.22.0

Have you checked the troubleshooting guide?

  • I have checked the troubleshooting guide and I haven't found a solution to my problem
Originally created by @patrickbolle on GitHub (Mar 17, 2025). Original GitHub issue: https://github.com/karakeep-app/karakeep/issues/1133 ### Describe the Bug Love Hoarder! I am trying to use it to store some Substack articles, but they seem to fail at random. Trying to recrawl them just shows a 403 error. Here's an example of one that 403s: https://joyarbitrage.substack.com/p/find-your-2ers Not sure how to debug further. Is Hoarder trying to scrape the page / load the data via my server, and that server is being blocked by Substack potentially? Here are the server logs: ``` 2025-03-17T19:39:03.832Z info: [Crawler][799] Will crawl "https://joyarbitrage.substack.com/p/find-your-2ers" for link with id "pzw7ps12d8asic0ym92phb5d" 2025-03-17T19:39:03.832Z info: [Crawler][799] Attempting to determine the content-type for the url https://joyarbitrage.substack.com/p/find-your-2ers 2025-03-17T19:39:03.894Z info: [Crawler][799] Content-type for the url https://joyarbitrage.substack.com/p/find-your-2ers is "text/html; charset=utf-8" 2025-03-17T19:39:04.170Z info: [Crawler][799] Successfully navigated to "https://joyarbitrage.substack.com/p/find-your-2ers". Waiting for the page to load ... 2025-03-17T19:39:05.849Z info: [Crawler][799] Finished waiting for the page to load. 2025-03-17T19:39:05.852Z info: [Crawler][799] Successfully fetched the page content. 2025-03-17T19:39:06.069Z info: [Crawler][799] Finished capturing page content and a screenshot. FullPageScreenshot: false 2025-03-17T19:39:06.073Z info: [Crawler][799] Will attempt to extract metadata from page ... 2025-03-17T19:39:06.465Z info: [Crawler][799] Will attempt to extract readable content ... 2025-03-17T19:39:06.781Z info: [Crawler][799] Done extracting readable content. 2025-03-17T19:39:06.788Z info: [Crawler][799] Done extracting metadata from the page. 2025-03-17T19:39:06.851Z info: [Crawler][799] Stored the screenshot as assetId: 07e18c23-2172-42e3-b787-58d82574e12b 2025-03-17T19:39:06.851Z info: [Crawler][799] Downloading image from "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAKkAAADACAYAAAB/LkO9AAAMFElEQVR4Aeydc7QzTRLG67ONzPQkV59t27btSXfP2rZt27Zt27Zt79bWM3fPPafPq+x7k0xlUn/8XmPqqd/NdA2JH3/FpXznvY4SSBmGcRT8JH7cFR/m0PkLh95j2XcLgRrGMAr4CC/hJ/ETrvkI93dkrhxzKH7MsXcnjnttLtCYMYzN4R88hI/wEn5C0g9zKT/p58xBqGV1n5M/fBkzrSPQiDGMdeAbvIN/8BA+wkv4mUoKQHSLhOKd//Td4wUaEYZxPDxbcq52cABJE1lD/k8O3RdznN1DIMMYEnvAK/iVyjm4pCn4CPbud/8OxUP4tvMdgdYSw+jAI/jEVSLnMiUFXrhVvV79Lofurbl0mwpkGAMBX+AN/IFHPvFrOJKCdLgqPsOxe6FAhrEGLoQvyVAERiZpsl6teRPH2SMFSjAMeAE/QjIUjVHSdL36dw7dZ3G/u4tAxtSzC3yAF+m6sylJgV+S9dccigdyuev2AhlTx/bof+1Bla47m5Z0RVmD+yaH7s186503EshoPRuh3+h7KqdGSVccrj7KceYcgYzWcg76nA5F+iVNhytfy/oGjr1DBDJawyF1X30yFE2WpAmLS4C/CE9iPzcn0MRizKGP6GcyFE26pOlwlf+MY3FPvt3c1gJNDMbW6Bv6l6w7WyXpisPV1zj0buDyoA0EUouxAfqEfqVytlTShLC0Zn0/VzOnCKQO45S6PzEZiqZH0nS4yv/DsXi5sL9AjWOgDy9HX5KhaGolTYerP3LsPp7jTj2Bxo7RQ/7owwpDkUm6wnr1pxLWXeQg8ZYCGSNnS+SN3JN1p0k60GnWL3DoXsn3u9+6Ag0dY13ki5wHl9MkTQmOOYLiPRxnTxBoWBjIE7k65JzmbpKu9XD1Lw7FS2Xi3EsgY63ZCzkiz2EORSZpMlzlv+fYfTjHuUyggTEy5Ib8RjkUmaTpcPV9DsXt+Jp9NxNolRibISfkNa51p0m64pVWn+cwc4lAxgpcgnzSK5RM0vEThVDL+haOM0cLZEgOyAO5xOZ7ZJKucBtL8TyuFnYVaArZFfWnt22YpFrXq7/h0HuwHKTeQaApYAfUi7qTdadJOgmy5t/mUHi+fm5jgVrIxqgPdaZymqSTOFx9nKveeQK1iPNQl+qhyCRdi2cExOJN7LuHCjTBHIo6Br+X3SSdxOHqrxyKp3CYWRBogljAdmP7GxqKTNIGhqtfcOjeh2/qbiuQYrbFdmJ703WnSTpFshbf5Ni9iS/Za0OBFLEhtgvbl8ppkk4fYWnN+kGR4nSBFHD64vZoHIpM0uZvYwnFq/5RzR4oUAMciP9/9bdtmKTG4nr1T1x1n8TlwoxAI+dWu81z1XvK4v9rcpqk/9/JgJ9y6N55ZLexhH22ETnvIXL+fJnrTpPUZHVfYj9zDb/ikvUEWja4l72avZFj8VWT0yQd5nAFlv+MgDh/qsg5yqHIJLXhyv3vGQFz+ws0OPjzvZezd2McikxSG67izCDPCOjhz9lQZJI2eTLgJ1ytOFzVP69m74zft3WnSapF1i/yrRau5Cfe6PA9fm5ymqQ6h6tbL/zGhqI2S+pTJjPcbFL3CAkmKfBCdMyVEIV6o/I/cd/9bpH8D+zxXtOlP2O7zlFkHwTkjLyRO6j7kPaH/TRJGpae+PwX7mcfkVAeKVzBZecwvrkzz7fquZroetwv9uMyO4u9uxv77PXcx1Oi1/piXwO5AeSIPJEr8kXOyBu5A/QB/UBf0B/0Cf2qpW6rpMlDc/OvyI/vwTHfg4nWEWhgfGdHDu5K7udvZ5//22QdXE7khdyQH3IUaGDQJ/QLfUP/ImRtjaTJFfHfksJKjjtsLtCy8dnxEtjbBgjMhjnkJHkJtFzQP/QR/WzHix1Czb+luCfyrbMdBBoq+Ar32fW4ECQNzFi6QAb5pHusoYB+oq/orzBhkqY3v/2KQ3aJQCPllmJXCeqDJmoi6AeRi0AjBf1Fn6ObEEkTQfPv8S3ZIQKNhVtvuyX7/FXSIBMUOUgeAo0F9Bn9jk67pImgP+Cys49AYwUPS/D5K9GoKRb0lY08DAP9Rt+jUy6pF/D8y1vckQI1wo3bbyHbgkvhpm6CR92oX6BGQN/Rf69Z0oCg3E0CNUroLLB3P+EwPVM86kXdAjUK+h+cUkmxqymzVwikg86VaVjtBXWiXoFUAA8qp0xSX/Mb+Xd2EkgFOOzSz9/M0U3Dbv7NqFcgDcAD+CAokDT5FM0fKpAqcGrPu3+0WVLUhzoFUgV8qJwSSb3Qd7/lm/JZgdRRuje19dMUdaE+gdQBH+CFb1TS5FP0xQKpxOcXtFlS1CeQSuBF5RRIWgvgzhNIJddvvbVs5484tG+iR12oTyCVwIvYtKS+/v6XyVU1GulnL2/bAX7Ug7oEUgu8qP1oRNJkd/M+gVRTutu0UVLUJZBq4Ed0TUiaXIL3RIFUU2YntHF3j7oEUg38qBqXNL+9QKrpd3eR7f0r+/bc/oF6UJdAmoEfzUu6eIsBaQbXPmJt1DJJf4m6BNIM/GhW0lifCr1EINVUxXayvT9vmaQ/R10CqQZ+RJPUJDVJ17gmvUYg1cQdMtneX7dM0l+jLoE0Az8UrEnzuwukmluyvbjfrnP4qAd1CaQa+KFgcHqOQKrBveXBte7yPNQlkGbgR/ODUz//FN+P1hVILT6/ZxsP5qMugdQCL+BHdI2fFv0TnnghkFrK/O1tu8gE9aAugdQCL+CHV3AVVHLLiDZuLrqyneklY+0Znn6L+gRSCbxQcxVUmb9T05XhCWUW0qDaA+pCfQKpAz7Ai6hBUuDd3znkBwmkipI2wJqorfc6oS7UhzoFUgV8gBeqbh/p5y8RSBfZxS0VNBEVdQqkCfig4vaRhFAfhzxWjaDXdDbjvvvCNEiKOlGvQCqAB4s+6JE0ORylJawye8C0PMkEdaJegRoH/U8PO2mRNNntP7b5M0zuZA7ub+yn58nNqBd1C9Qo6H/l1D9m5z/y47IxQYPbjb37Ybqbbz+oF3WjfoGaAH1H/1U/Zif5qu53rhBorNyqM88+/zJHN81PdP4ychBorKDf6d5LraSJqOP9RN1xP/b5V6dW0FTUryIPgcZDXqaCapc0EbX+8aNHPkzhmkWf/zwR1ET9OXIZw5D0aPQ5EXRiJE3fw/kxviU7TqChglOCIX+6BPQfDiZnQhCQC/IZxalT9BN9rVxLXuwQ3eIzi0L+/KGcmSq3z+vrFL378RpCMiQf5IS8kJtAywL9Qx/Rz+ha9oocvxQY1i5v4TK7jqt8VqCBCDPbcN+dwj5/mvDT9J1CxoDvzvop8kOOyFOgQUCf0C/0Df1L3pnaJklXeBsbwNvX8CKrEsHld2CfXSq/dk5N6c6XX/MSysPZ52+WH/8QYQ8jIHtZb87IE7kiX+SMvJE7QB/QD/QF/UGflnrmp+3dov9t7y7BvNjiOA5PL+tuaW+6DkQ20h/cHdYiWvCIW6Ti7u7WE4mK9AeHH2dwdznhDW/768ynnvm+OR9Y6ntHv8nGTK99/pGCSEGkb0KkIFJECiIFkSJSECkinXhFpGQbaeqziFWjDkVPQ0RPE2SmIco+i5gzqDF6mwancruyAqnLss8y0qzB97wZRAoiRaQgUkQKIgWRIlIQKYgUkYJIESmIFESKSEGkUMTsckGitfMrMKPljx/Ndf2I1GcRa8Zsie76u9HddA/ykrpMfRaxcerV6K5zfJb8lF2mPlOkUy7FzLpI5UJeyi5TnyJFpCBSRAoiBZEiUhApIhUpIgWRIlIQKYgUkYJIyZ1IEenUyyIl20hTn0Wsn3Aq+uoj+pohM/VR9lnEsq7O6GkaFt2Nw/kSLT9RI28qu0x9lpFC1r7nzSBSECkiBZEiUhApiBSRgkhBpIgURIpIQaQgUkQKIoUi5v9VFX0t/0ZP8398qY6foJl3pS7LPotYO2Z3dNc/ju6mJ5CX1GXq8/lso3P35Gim2UY8ZgdEikhBpCBSRAoiJWciBZEiUhApIgWRgkgRKYgURIrZRvhhs43rJlyIGZURM2r4rNqIadURU6t+vPS55ee7xm+qjLLPIjb1DYkdC8/GziWXYgeftvhCHFhxLY5tvB3HNtz8cTbeikNrrqd7cNE1fmFnkros+3wKY1ZelSrvOhIAAAAASUVORK5CYII=" 2025-03-17T19:39:06.852Z info: [Crawler][799] Downloaded image as assetId: 85e7a266-1a69-43e4-9c7b-71b3f9fccb0b 2025-03-17T19:39:07.814Z info: [Crawler][799] Completed successfully 2025-03-17T19:39:08.095Z info: [VideoCrawler][802] Skipping video download from "https://joyarbitrage.substack.com/p/find-your-2ers", because it is disabled in the config. 2025-03-17T19:39:08.096Z info: [VideoCrawler][802] Video Download Completed successfully 2025-03-17T19:39:08.258Z info: [search][801] Attempting to index bookmark with id pzw7ps12d8asic0ym92phb5d ... 2025-03-17T19:39:08.358Z info: [inference][800] Starting an inference job for bookmark with id "pzw7ps12d8asic0ym92phb5d" 2025-03-17T19:39:08.611Z info: [webhook][803] Starting a webhook job for bookmark with id "pzw7ps12d8asic0ym92phb5d" 2025-03-17T19:39:08.611Z info: [webhook][803] Completed successfully 2025-03-17T19:39:08.715Z info: [inference][800] Inferring tag for bookmark "pzw7ps12d8asic0ym92phb5d" used 242 tokens and inferred: 2025-03-17T19:39:08.772Z info: [inference][800] Completed successfully 2025-03-17T19:39:09.382Z info: [search][801] Completed successfully 2025-03-17T19:39:09.549Z info: [search][804] Attempting to index bookmark with id pzw7ps12d8asic0ym92phb5d ... 2025-03-17T19:39:10.589Z info: [search][804] Completed successfully ``` ### Steps to Reproduce 1. Go to https://joyarbitrage.substack.com/p/find-your-2ers 2. Use Hoarder extension or otherwise to input into Hoarder 3. See if it loads data or not. 4. Check Broken Links page for 403 error. ### Expected Behaviour Hoarder should load the data properly for all Substack pages. ### Screenshots or Additional Context ![Image](https://github.com/user-attachments/assets/bb293db5-07f6-4e8c-a0af-51640a79072e) ![Image](https://github.com/user-attachments/assets/75644705-e1b9-427e-b7ee-e4f8af5d37e5) ### Device Details Firefox on Linux. ### Exact Hoarder Version 0.22.0 ### Have you checked the troubleshooting guide? - [x] I have checked the troubleshooting guide and I haven't found a solution to my problem
Author
Owner

@vince-p commented on GitHub (Dec 7, 2025):

I have this same problem.
Substack articles never save.
Any advice on how to fix?

<!-- gh-comment-id:3621872200 --> @vince-p commented on GitHub (Dec 7, 2025): I have this same problem. Substack articles never save. Any advice on how to fix?
Author
Owner

@thomaslazar commented on GitHub (Dec 26, 2025):

Encountered this issue today trying to hoard https://dollardhingra.substack.com/p/questions-software-engineers-should

<!-- gh-comment-id:3692871136 --> @thomaslazar commented on GitHub (Dec 26, 2025): Encountered this issue today trying to hoard https://dollardhingra.substack.com/p/questions-software-engineers-should
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/karakeep#740
No description provided.