Data analysis of Netflix content using R
Project Overview
This project involves data cleaning and visualization of a Kaggle data set containing movies and TV shows available on Netflix.
Data Cleaning
I made the following changes to clean the data set:
- Identified and replaced incorrect values in columns
- Used a regular expression to convert
date_added
from a character type to a date type
- Replaced ratings that have the same meaning with a common value, so they're represented in one, standard way
- Made new columns for
duration
based on type
Data Visualization
I created a few visualizations to summarize the data. Below are two of them.
![Untitled](https://s3-us-west-2.amazonaws.com/secure.notion-static.com/23e797e9-b66d-4222-8d02-98f0f0abca70/Untitled.png)
![Untitled](https://s3-us-west-2.amazonaws.com/secure.notion-static.com/acd1dcc0-03ad-4742-9293-0845d795bbe1/Untitled.png)
Full Project
https://mw-projects-netflix-r.netlify.app/