Kaggle dataset
- Data was downloaded from Kaggle. The utilized dataset included data from transfermarkt.de and fbref.com for 2018/19 season merged by Rafael Stepien. This data contains 2100 observations and 400 variables containing information about 2018/2019 season for soccer players from teams competing in the top 5 European Leagues (La Liga, Premier League, Serie A, Bundesliga and Ligue 1).
Countries names
- Countries names were acquired from github . This dataset included countries’ names as well as their abbreviation, allowing us to merge it with the main dataset by their names abbreviation using the inner_join() function in R Studio.
World coordinates
- Map coordinates (latitude and longitud) for each country were downloaded from opendatasoft. This data set was merged with the main data set by the countries’ names also with the use of inner_join() function in R.