Exploratory Data Analysis: Covariation of a Categorical and a Continuous Variable

Which Wine Varieties are the Most Affordable? In my recent post about Variation, I used Kaggle’s Wine Reviews data set to explore the variation within wine variety, specifically to find the most common wines, with the most common being Chardonnay. I then looked at the variation of Chardonnay prices and ultimately found that the outliers in the data set may have been entered by error. A great example of why exploratory data analysis is needed for any project. Now, instead of looking at the variation in one variable, in this post I want to use the data to see the … Continue reading Exploratory Data Analysis: Covariation of a Categorical and a Continuous Variable

Exploratory Data Analysis: Covariation of Two Categorical Variables

Missing Migrants Project In the past few years, there has been a huge refugee crisis worldwide as migrants leave their home countries in hopes of better lives. They leave their homes for a variety of reasons from natural disasters, poverty, violence and war to being dissatisfied with the state of their country. Unfortunately, not all migrants make it to their destination as many go missing and die along the way. This led to the Missing Migrants Project which tracks the deaths of migrants, including refugees, who have gone missing along major migration routes worldwide. Data like this is important because … Continue reading Exploratory Data Analysis: Covariation of Two Categorical Variables

Exploratory Data Analysis: Covariation of Two Continuous Variables

Freedom and Happiness This week I came across the World Happiness Report, an annual survey which represents 156 of the world’s countries and how happy the citizens of these countries perceive themselves to be. The report calculates positive and negative emotion based on six key explanatory factors: social support, freedom, corruption, generosity, GDP and life expectancy. I want to explore the correlation between freedom, the freedom to make life choices, and positive affect, the measure of positive emotion. More specifically, I want to see which countries have the happiest citizens and which countries do not based on the freedom that … Continue reading Exploratory Data Analysis: Covariation of Two Continuous Variables

Exploratory Data Analysis: Variation

A Look at Variation in Wine Data If you read my last post on Exploratory Data Analysis, then you know that there are many ways to explore a data set. And if you haven’t read it yet, pause and Click Here to read it. This post covers how looking at the variation within a variable can reveal interesting information about the data that you are working with. Variation specifically looks at how the values of a variable changes from one measurement to another. Because of this, each variable will have its own unique pattern, and the only way to see … Continue reading Exploratory Data Analysis: Variation