[GH-ISSUE #151] Incorporating Understat as a Data Source #34

Closed
opened 2026-03-02 15:55:13 +03:00 by kerem · 2 comments
Owner

Originally created by @Kalaweksh on GitHub (Jan 21, 2023).
Original GitHub issue: https://github.com/probberechts/soccerdata/issues/151

Understat exposes data, current and historical, without any apparent restrictions on usage (i.e lacking a robots.txt) on the big 5. It has detailed data on both the player and club level, and even an unofficial API that could perhaps be integrated as a data source.

Originally created by @Kalaweksh on GitHub (Jan 21, 2023). Original GitHub issue: https://github.com/probberechts/soccerdata/issues/151 [Understat](https://understat.com/) exposes data, current and historical, without any apparent restrictions on usage (i.e lacking a robots.txt) on the big 5. It has detailed data on both the player and club level, and even an [unofficial API](https://github.com/amosbastian/understat) that could perhaps be integrated as a data source.
kerem 2026-03-02 15:55:13 +03:00
Author
Owner

@lorenzodb1 commented on GitHub (Jul 10, 2023):

Are there any stats you could get from understat that you couldn't get from FBref (besides the difference in the xG model used)?

<!-- gh-comment-id:1629398480 --> @lorenzodb1 commented on GitHub (Jul 10, 2023): Are there any stats you could get from understat that you couldn't get from FBref (besides the difference in the xG model used)?
Author
Owner

@Kalaweksh commented on GitHub (Jul 10, 2023):

Are there any stats you could get from understat that you couldn't get from FBref (besides the difference in the xG model used)?

I believe xGChain and xGBuildup, for players at least. Although I think those could be calculated from the stats FBref exposes.

There are a few more alternative stats for teams, such as a team's tempo (what is Dortmund's xG on a slow attack as opposed to their xG for a fast attack) and shot zones.

Understat is also much faster to scrape and allows users to scrape between arbitrary dates, avoiding unnecessary scraping.

<!-- gh-comment-id:1629427253 --> @Kalaweksh commented on GitHub (Jul 10, 2023): > Are there any stats you could get from understat that you couldn't get from FBref (besides the difference in the xG model used)? I believe xGChain and xGBuildup, for players at least. Although I think those could be calculated from the stats FBref exposes. There are a few more alternative stats for teams, such as a team's tempo (what is Dortmund's xG on a slow attack as opposed to their xG for a fast attack) and shot zones. Understat is also much faster to scrape and allows users to scrape between arbitrary dates, avoiding unnecessary scraping.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/soccerdata#34
No description provided.