mirror of
https://github.com/probberechts/soccerdata.git
synced 2026-04-26 02:25:51 +03:00
[GH-ISSUE #219] [SoFIFA] Scraper gets blocked by bot protection service #47
Labels
No labels
ESPN
FBref
FotMob
MatchHistory
SoFIFA
Sofascore
WhoScored
WhoScored
bug
build
common
dependencies
discussion
documentation
duplicate
enhancement
good first issue
invalid
performance
pull-request
question
question
removal
understat
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/soccerdata#47
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @andrzej-konczyk on GitHub (Apr 21, 2023).
Original GitHub issue: https://github.com/probberechts/soccerdata/issues/219
Hi! How can I set up rpxy to get data? I would like to use read_players() function, but there is issue with download data. I am not sure how to set up properly proxy, I assume that can be issue. Current Error is ConnectionError: Could not download https://sofifa.com/.
@probberechts commented on GitHub (Apr 22, 2023):
It looks like SoFifa has installed stronger protection against scraping through CloudFlare. Setting up a proxy will not help. I do not have a quick solution for this. Probably we will have to keep track of some cookies and add them to the request header or switch to a Selenium-based scraper to bypass the block.
@probberechts commented on GitHub (Apr 28, 2023):
Based on some limited initial tests, it seems to work with cfscrape.