[GH-ISSUE #123] Integrate scylla with proxy for web scraping #94

Closed
opened 2026-03-02 23:33:34 +03:00 by kerem · 1 comment
Owner

Originally created by @AJaySi on GitHub (Sep 11, 2024).
Original GitHub issue: https://github.com/AJaySi/ALwrity/issues/123

Originally assigned to: @AJaySi on GitHub.

Scylla: A free, open-source proxy pool for scraping.

Originally created by @AJaySi on GitHub (Sep 11, 2024). Original GitHub issue: https://github.com/AJaySi/ALwrity/issues/123 Originally assigned to: @AJaySi on GitHub. [Scylla](https://github.com/imWildCat/scylla): A free, open-source proxy pool for scraping.
kerem closed this issue 2026-03-02 23:33:34 +03:00
Author
Owner

@AJaySi commented on GitHub (Mar 13, 2025):

I did some research and ALwrity already has firecrawl, Exa and Tavily AI doing the same. There are free mostly and very generous in their free tier. Its better to use them then cook up our own. There are problems of getting banned etc with scraping, which ALwrity does not want concentrate on, right now.
As and when above, web researching API become paid/expensive, we will have to consider such libraries.

Closing for now.

<!-- gh-comment-id:2720041283 --> @AJaySi commented on GitHub (Mar 13, 2025): I did some research and ALwrity already has firecrawl, Exa and Tavily AI doing the same. There are free mostly and very generous in their free tier. Its better to use them then cook up our own. There are problems of getting banned etc with scraping, which ALwrity does not want concentrate on, right now. As and when above, web researching API become paid/expensive, we will have to consider such libraries. Closing for now.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ALwrity#94
No description provided.