This dataset includes 120 years of Olympic history from Athens 1896 to Rio 2016.
The data is in the form of two csv sheets. One has basic bio data of athletes and medals won by them in respective sports
and the other has NOC regions. They are joined on NOC.
This dataset can be used to visualize things like participants per country, medals won by different countries in summer and winter Olympics from 1896 to 2016.
Link to Dataset
Stacked graph given above represents number of participants (Y axis) per country (X axis). The color gives us the split between male and female participants. Since the graph is sorted in descending order of number of participants we can see that USA has maximum number of participants in Olympics till now which is 18,853. (Male - 13,320, Female - 5,533). Also This data can be filtered to get number of participants for specific years of Olympics.
Above Line chart shows the total number of medals(Y axis) won by a country every year(X axis). We can see that for summer olympics USA won 138 medals in the year 1984 and for winter olympics it won 29 medals in 2010 which is USA's highest till now. Thus 1984 was best the year for summer olympics and 2010 was the best year for winter olympics for USA. Similarly by using region filter we can get total number of medals won in summer and winter olympics every year for that country. Two charts are given separately for winter and summer olympics. They can be swapped by using season filter.
Map given above shows number of gold medals won by country in the form of color intensity. Hovering over the region gives split of gold medals between summer and winter olympics. It can be seen that USA has won the maximum number of gold medals which is 1131 (Summer - 1035, Winter - 96) till now on the basis of the highest saturation of the color there. Also by using year as a filter we can see which country won maximum number of gold medals for that particular year.
The graphs given above gave us some good insights into the olympics dataset.
The questions above digs deeper into the dataset from giving information about total number of
participants per country to number of medals won by a country and finally to total number of
gold medals won by a country.
The dataset gives medals won by a participant and not team as a whole, thus I had to group data by team in order
to get total number of medals won by a country. Also getting number of gold medals specifically involved conditional summation.
Thus, in this way I have used Tableau for designing different charts to answer the questions given above.
Some interesting facts about dataset: We know that olympics are held every four years.
I wondered why are there some years missing on 2nd graph (Line chart), some googling led me to the fact that
Olympics were cancelled during those years due to World War 1 and 2.
Example: 1916 Summer Olympics was cancelled due to World War 1 and 1940,1944 Winter Olympics were cancelled due to World War 2.