Movie Popularity Based on the Cast
- rgutkows
- Oct 3, 2022
- 1 min read
Updated: Jan 24, 2023
Introduction
I will be working with a movie database with over 45,000 different movies from Full MovieLens. When looking for movies to watch you normally will check the rating and the cast for any famous or favorite actors and actresses. I want to know how different famous cast members' movies do base on revenue and rating.
Database
This database was pulled from Kaggle. The database gives extensive information about different movies. I am looking to use the movies_metadata for movie names and revenue, credits for the list of the cast, and ratings_small for the movie's ratings.
https://www.kaggle.com/datasets/rounakbanik/the-movies-dataset
Pre-Processing
I first found how many null values there were in total for each field. I then dropped the null values for each field that I did not need. This helps to reduce the overall about of data allowing the data to be read faster.



Visualization


Conclusion
I ran into an issue when trying to set up a visualization of cast members and the corresponding movie revenue. Once I am able to set that graph up properly, I believe that this information can help impact movie directors when deciding who to bring in for certain roles. They can see who has performed higher than others based on the rating and revenue generated. This can however hurt new actors and actresses coming into the industry as their portfolio is not built. Some directors could overlook them as possible candidates.
Comments