Pandas is the most useful data analysis package in Python. You can use it to clean-up, transform and analyze data. Recently, I had a chance to use Pandas for some work. So, in this video let me share a quick but fairly in-depth introduction to Pandas with a real-world case study.
๐โโ
SAMPLE FILES & INSTRUCTIONS:
==================================
~ i n s t a l l a t i o n โ ~
Download and install Pandas & other data analysis packages:
- from Anaconda https://www.anaconda.com/
- from Pandas website https://pandas.pydata.org/
- from VS Code https://code.visualstudio.com/docs/datascience/data-science-tutorial
~ s a m p l e f i l e ๐ ~
- Direct Download https://chandoo.org/wp/wp-content/uploads/2022/09/hotel-booking-data.txt
- On Kaggle https://www.kaggle.com/datasets/ronecone/hotel-booking-data
~ c o d e ๐จโ๐ป ~
- Jupyter Notebook https://files.chandoo.org/py/pandas-intro.ipynb
- Kaggle Notebook https://www.kaggle.com/ronecone/data-cleanup-simple-analysis
โฑ VIDEO CHAPTERS ๐
===================
0:00 - What is Pandas for Python?
0:55 - About the sample data & getting started
2:17 - Importing the libraries and loading data to data frame
4:08 - Fixing the data frame delimiter issue
5:21 - Using basic pandas functions (describe etc.)
7:15 - Fixing the text data issue by removing them
8:35 - Solving the problem correctly
11:20 - Adding a column with IF this then that type rule
13:38 - Filling the text value up (using fillna method)
16:13 - Removing the unnecessary rows
16:37 - Doing quick analysis of data (tables & graphs)
๐ LEARN MORE ABOUT PYTHON
============================
Watch my 1-hour Python Tutorial to understand and write your first programs - https://youtu.be/uoC48wrJ-yM
๐คทโโ๏ธ Other ways to clean this data
===========================
We can also use Power Query in Excel to cleanse and process this data. See this tutorial for that - https://youtu.be/bSaLlNLAW44
๐ LEARN MORE ABOUT PANDAS
============================
Data Science with Pandas - Tutorial on VSCODE page - https://code.visualstudio.com/docs/datascience/data-science-tutorial
Pandas documentation & examples - https://pandas.pydata.org/docs/getting_started/intro_tutorials/01_table_oriented.html
๐๐๐ Pandas books:
Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython by Wes McKinney - https://amzn.to/3QPTgCt
Pandas Cookbook by Daniel Chen - https://amzn.to/3ROiQbW
~
๐ผ What do pandas use to record their thoughts?
Notebooks of course ๐ ๐คฃ
#pandas #python #datascience