Olympic Dataset 1
Olympic Dataset 1
• Abstract
▪ Data Collection
▪ Data Preprocessing
• Handling Missing Data: Rows with missing values, particularly for key
columns like Age, Height, and Medal, were either removed or imputed.
• Predictive Modeling
Country-Level Analysis
• Top Performing Countries: Countries like the USA, Russia, and China
consistently outperform other nations, likely due to better infrastructure,
funding, and sports training programs.
• Economic Influence: Wealthier nations with higher GDPs tend to have
more athletes in a variety of sports, and they generally perform better
across the board. Countries with lower GDPs may have fewer athletes
participating but sometimes excel in niche sports where they have
specialized training programs.
• Conclusion
The exploratory data analysis of the Olympic dataset revealed important
insights into participation trends and the factors that influence Olympic
success. Over the years, there has been an increase in the number of athletes
and countries participating, with a significant rise in female athletes.
Demographic factors such as age, height, and weight were found to impact
an athlete's likelihood of winning a medal, with younger athletes excelling in
sports like gymnastics and older athletes performing better in endurance
events. Countries with higher GDPs tend to perform better, as they have
more resources for training and athlete development. The analysis also
highlighted that a small number of countries dominate the medal counts,
while others face challenges in securing medals. The inclusion of new sports
like surfing and skateboarding reflects changing global interests. However,
the analysis is limited by missing data and the lack of more detailed
information about athletes. Future research could incorporate additional data
to provide a more comprehensive understanding of Olympic performance.
• Future Work