top of page
Search

Movie Popularity Based on the Cast

  • rgutkows
  • Oct 3, 2022
  • 1 min read

Updated: Jan 24, 2023

Introduction

I will be working with a movie database with over 45,000 different movies from Full MovieLens. When looking for movies to watch you normally will check the rating and the cast for any famous or favorite actors and actresses. I want to know how different famous cast members' movies do base on revenue and rating.


Database

This database was pulled from Kaggle. The database gives extensive information about different movies. I am looking to use the movies_metadata for movie names and revenue, credits for the list of the cast, and ratings_small for the movie's ratings.

https://www.kaggle.com/datasets/rounakbanik/the-movies-dataset


Pre-Processing

I first found how many null values there were in total for each field. I then dropped the null values for each field that I did not need. This helps to reduce the overall about of data allowing the data to be read faster.

ree
ree
ree





Visualization


ree
ree

Conclusion

I ran into an issue when trying to set up a visualization of cast members and the corresponding movie revenue. Once I am able to set that graph up properly, I believe that this information can help impact movie directors when deciding who to bring in for certain roles. They can see who has performed higher than others based on the rating and revenue generated. This can however hurt new actors and actresses coming into the industry as their portfolio is not built. Some directors could overlook them as possible candidates.


 
 
 

Recent Posts

See All
Ethical Issues in Classification

Classification raises many questionable dilemmas, especially with large companies that have access to the population's personal...

 
 
 

Comments


bottom of page