Disney Movies Analysis Project

Principles of Computing - Project ID: 26.

Project Description

Disney movies have been a mainstay of the entertainment industry for the past half-century. Very few other movie studios have had the same prolonged success or cultural impact as Disney. So what makes Disney movies so much more successful than others? Is there a secret to Disney’s success, or is it the culmination of trial and error coupled with a bit of luck?

This data visualization project analyzes Disney movie performance from 1960 to 2016, investigating the categories of movie genre, MPAA rating, studio divisions, and relationships to the performance of the US economy, all in comparison to total gross income.


Exploration Questions - In trying to discern which factors contribute to the success of Disney movies, these questions guided our investigation.

Question 1: Which genre within Disney movies produces the highest income?
Question 2: How does the MPAA rating of a Disney movie affect its income?
Question 3: Is there any difference in the rating of Disney movies over time?
Question 4: Is there any difference in the rating of Disney movies based on genre?
Question 5: How does the national economy affect the income of Disney movies?
Question 6: How do Marvel or Pixar movies compare to other Disney movies in income?
Question 7: Does the month of release for a Disney movie affect its income?

Data Sources and Data Journey

Data Sources - We started off this project with a CSV (Comma Separated Values) data source which had every Disney movie released up until 2016 along with their release date, genre, MPAA rating, total gross income, and inflation adjusted total gross income. This data set served as the catalyst for our project and guided our exploration as we tried to compare these aspects of Disney movies to see their success. From there we realized we wanted to compare the income of Disney movies to the US GDP, to examine the effect of the economy on Disney movies. To do this we found a JSON (JavaScript Object Notation) data source that had the GDP of the US each year from 1960 to the present day. Finally, we wanted to see if certain studios of Disney movies performed better than the main corpus of Disney works. This led us to find two additional CSV files with the names of Disney Pixar movies and Disney Marvel movies.

Data Journey - Many of the questions we wanted to explore revolved around the average inflation adjusted total gross income of Disney movies, whether it be by year, movie genre, MPAA rating, or studio. This meant that the majority of our data manipulation was separating the movies into these categories, calculating those averages, and creating data frames that organized that information for visualizations.

Data Caveats - There are a few caveats with our data sources. Primarily, the main Disney movies CSV we used were incomplete in the MPAA ratings of the movies, with many of them not rated. This may impact some of the insights we make based on MPAA ratings. Another caveat is that the Disney movies CSV only goes until 2016, meaning more recent movies are left out, and no impact from the COVID-19 pandemic is taken into account. A final caveat is that in order to make the GDP data and Disney movie data compatible, the starting point of our data was set at 1960, meaning eight early Disney movies released before that date are not included in our analysis.

Project Members

Daniel Stracensky - Daniel is a junior from Ohio. He is a History and Classics double major and wanted to learn about code and computing because he thinks it is a valuable skill and could help in examining historical data.

Alphonsus Koong Bok Hui - Alphonsus is a freshman from Singapore. He intends to major in Computer Science and Philosophy in order to harness technology to solve social, political, and economic problems in the world.