mirror of
https://github.com/probberechts/soccerdata.git
synced 2026-04-25 18:15:58 +03:00
[GH-ISSUE #252] Data differents from WhoScored #51
Labels
No labels
ESPN
FBref
FotMob
MatchHistory
SoFIFA
Sofascore
WhoScored
WhoScored
bug
build
common
dependencies
discussion
documentation
duplicate
enhancement
good first issue
invalid
performance
pull-request
question
question
removal
understat
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/soccerdata#51
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @REM4125 on GitHub (May 24, 2023).
Original GitHub issue: https://github.com/probberechts/soccerdata/issues/252
Hello,
Thank you for this great package!
Two small questions:
@probberechts commented on GitHub (May 25, 2023):
Opta data is available in many formats (JSON and XML) and levels of detail (F24, MA3, ...). The data stream that can be scraped from WhoScored is based on Opta data, but WhoScored uses its own JSON format and level of detail. It mostly contains the same information as F24 streams but there are some minor differences.
The output of the WhoScored scraper can be directly used in socceraction. The two libraries are perfectly compatible. If you look at the schema for Opta data in socceraction you'll notice that the "keypass" column is optional. This is one difference between the F24 streams and the WhoScored JSON format. The WhoScored JSON data does not have a "keypass" attribute at the root level, but you can probably derive it from the list of qualifiers of each pass event.
I can't check the difference between the output of soccerdata and ScraperFC, as the WhoScored module was removed from ScraperFC. Nevertheless, I assume it is just a matter of how you massage the JSON data to get a dataframe and maybe some updates in the data stream itself. With soccerdata you can get the raw output JSON using: