Skip to content

The main objective of this analysis is to study various attributes such as title, genre, type (movie or TV show), release year, country, rating, duration, and date of addition on Netflix.

Notifications You must be signed in to change notification settings

Anandsi978/NETFLIX-MINOR-PROJECT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

3 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ“Š Netflix Data Analysis

๐Ÿ“ Project Overview

The Netflix Data Analysis project aims to explore, clean, and visualize Netflixโ€™s dataset to uncover key insights about its movies and TV shows. The analysis helps in understanding content trends, popular genres, distribution across countries, and the evolution of Netflixโ€™s library over time.

Through this project, we perform data cleaning, exploratory data analysis (EDA), and visualization using Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

๐ŸŽฏ Objectives

Analyze Netflixโ€™s catalog of movies and TV shows.

Identify trends in genres, ratings, and release years.

Compare the distribution of movies vs. TV shows.

Visualize country-wise and time-based content growth.

Gain hands-on experience in data preprocessing and visualization.

๐Ÿ—‚๏ธ Dataset

Source: Netflix Dataset on Kaggle

File: netflix_titles.csv

Key Columns:

show_id

type (Movie/TV Show)

title

director

cast

country

date_added

release_year

rating

duration

listed_in (Genre)

description

๐Ÿงฉ Technologies Used

Programming Language: Python

Libraries:

Pandas โ€“ Data manipulation and cleaning

NumPy โ€“ Numerical computation

Matplotlib & Seaborn โ€“ Data visualization

Jupyter Notebook โ€“ Interactive analysis environment **

โš™๏ธ Project Workflow

** Importing Libraries & Dataset

Load the Netflix dataset into a pandas DataFrame.

Data Cleaning

Handle missing values, remove duplicates, and standardize formats.

Exploratory Data Analysis (EDA)

Summary statistics, unique values, and type distributions.

Visualization

Graphical insights using bar plots, pie charts, heatmaps, and count plots.

Insights & Conclusions

Summarize findings and interpret key trends.

๐Ÿ“ˆ Key Insights

Movies make up the majority of Netflixโ€™s content.

The USA and India contribute the most titles to Netflixโ€™s catalog.

Drama and Comedy are the most common genres.

A significant increase in Netflix content can be seen after 2015.

The most common content rating is TV-MA (for mature audiences).

๐Ÿ’ก Conclusion

The analysis provides a clear picture of Netflixโ€™s content strategy and growth. It highlights how Netflix has expanded globally, diversified its genres, and increased its focus on original programming in recent years.

๐Ÿš€ How to Run the Project

Clone this repository:

git clone https://github.com/your-username/netflix-data-analysis.git

Navigate to the project folder:

cd netflix-data-analysis

Install required libraries:

pip install pandas numpy matplotlib seaborn

Open the Jupyter Notebook:

jupyter notebook netflix_analysis.ipynb

๐Ÿ‘ค Author

Anand Singh B.Tech โ€“ Cloud Computing & Machine Learning ๐Ÿ“ง Email: anandsi1726j@gmail.com

About

The main objective of this analysis is to study various attributes such as title, genre, type (movie or TV show), release year, country, rating, duration, and date of addition on Netflix.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published