mirror of
https://github.com/probberechts/soccerdata.git
synced 2026-04-25 10:05:53 +03:00
[GH-ISSUE #916] [fbref] Error fetching schedule on 1.8.8 #201
Labels
No labels
ESPN
FBref
FotMob
MatchHistory
SoFIFA
Sofascore
WhoScored
WhoScored
bug
build
common
dependencies
discussion
documentation
duplicate
enhancement
good first issue
invalid
performance
pull-request
question
question
removal
understat
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/soccerdata#201
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @SportlyxLabs on GitHub (Jan 18, 2026).
Original GitHub issue: https://github.com/probberechts/soccerdata/issues/916
Describe the bug
A call to fbref schedule fails with 403 error , even with 1.8.8 version
Affected scrapers
This affects the following scrapers:
Code example
A minimal code example that fails. Use
no_cache=Trueto make sure an invalid cached file does not cause the bug and make sure you have the latest version of soccerdata installed.Error message
Additional context
Add any other context about the problem here.
Contributor Action Plan
@Messe57 commented on GitHub (Jan 19, 2026):
I am facing the same issues. I thought that it was my fault, but it seems that is not the case.
@zshott commented on GitHub (Jan 19, 2026):
I was able to use FBref scraper last week, but now I am also getting 403
Also, your name is in the paths in the error message, maybe you should remove that.
@saedstudent commented on GitHub (Jan 19, 2026):
same here
@spanalytic commented on GitHub (Jan 20, 2026):
Got the same problem is there any fix?
@probberechts commented on GitHub (Jan 20, 2026):
It looks like FBref has recently strengthened its anti-bot protection (they are behind Cloudflare).
I haven’t had the chance to investigate this in depth yet, and I can’t give a timeline for when I'll be able to do so.
As a temporary workaround, one possible solution might be to:
@SportlyxLabs commented on GitHub (Jan 21, 2026):
Looks like fbref advanced data is gone
https://www.sports-reference.com/blog/2026/01/fbref-stathead-data-update/
@lvlun0532-spec commented on GitHub (Jan 21, 2026):
Bro, do you know of any other websites where I can view advanced data?
@lvlun0532-spec commented on GitHub (Jan 21, 2026):
Hey bro, do you know any other websites where I can view advanced data? Like, the kind that updates weekly.
@dimitrismoustakas commented on GitHub (Jan 24, 2026):
The problem is indeed caused by this. There is an relatively easy fix (you just need to be careful to not break WhoScored in the process) but I'm not sure I should commit it because together with FbRef changing what data are available I'm not sure what (if any) tests failing are due to me not making the full proper solution and what tests are failing simply because the data are no longer available. If we only had the schedule problem I'd have committed it already (I only use the schedule method anyway in my project) but not I don't have the time to fully investigate all the issues.
@lvlun0532-spec commented on GitHub (Jan 25, 2026):
Thank you. I think I've found an alternative, although FotMob's data can't
compare to FBRef's. However, it's sufficient for my model. Let's keep in
touch.
dimitrismoustakas @.***> 于2026年1月24日周六 20:36写道:
@guilhermecxe commented on GitHub (Feb 8, 2026):
We can bypass Cloudflare with the
BaseSeleniumReaderusing something like this:However, it uses PyAutoGUI internally and PyAutoGUI doesn't work with
headless=True. I tried to implement this but I am stuck there, making it headless.@thomaSLBY commented on GitHub (Feb 9, 2026):
Hi @guilhermecxe, you can complete the Driver instanciation with headless=True, headless2=True to get the scraping faster and less detectable.
@guilhermecxe commented on GitHub (Feb 13, 2026):
@thomaSLBY, like said previously, PyAutoGUI (used by
uc_gui_click_captcha) doesn't work with headless as True. Or am I missing something?@paulz1 commented on GitHub (Feb 23, 2026):
Are there something new on this ?
Seems that fbref doesn't work even putting the cookies from Firefox. Am I right or may be I do something wrong ?
Are there some other workarounds ?
@gustavoalikan1910 commented on GitHub (Feb 24, 2026):
@paulz1
Same issue here....