Handling Categorical Data in Machine Learning: Easy Explanation for Data Science Interviews
Handling categorical data in machine learning projects is a very common topic in data science interviews. In this video, Iโll cover the difference between tr...

Emma Ding
10.5K views โข Dec 19, 2022

About this video
Handling categorical data in machine learning projects is a very common topic in data science interviews. In this video, Iโll cover the difference between treating a variable as a dummy variable vs. a non-dummy variable, how you can deal with categorical features when the number of levels is very large, and the pros and cons of various strategies.
Feature hashing
https://en.wikipedia.org/wiki/Feature_hashing
๐ขGet all my free data science interview resources
https://www.emmading.com/resources
๐ก Product Case Interview Cheatsheet https://www.emmading.com/product-case-cheat-sheet
๐ Statistics Interview Cheatsheet https://www.emmading.com/statistics-interview-cheat-sheet
๐ฃ Behavioral Interview Cheatsheet https://www.emmading.com/behavioral-interview-cheat-sheet
๐ต Data Science Resume Checklist https://www.emmading.com/data-science-resume-checklist
โ We work with Experienced Data Scientists to help them land their next dream jobs. Apply now: https://www.emmading.com/coaching
// Comment
Got any questions? Something to add?
Write a comment below to chat.
// Let's connect on LinkedIn:
https://www.linkedin.com/in/emmading001/
====================
Contents of this video:
====================
00:00 Introduction
00:48 Categorical Data
02:22 Ordinal Features & Class Labels
03:38 One-Hot Encoding
05:32 Dummy Encoding
06:30 Problems of One-Hot & Dummy Encoding
07:26 Feature Hashing
Feature hashing
https://en.wikipedia.org/wiki/Feature_hashing
๐ขGet all my free data science interview resources
https://www.emmading.com/resources
๐ก Product Case Interview Cheatsheet https://www.emmading.com/product-case-cheat-sheet
๐ Statistics Interview Cheatsheet https://www.emmading.com/statistics-interview-cheat-sheet
๐ฃ Behavioral Interview Cheatsheet https://www.emmading.com/behavioral-interview-cheat-sheet
๐ต Data Science Resume Checklist https://www.emmading.com/data-science-resume-checklist
โ We work with Experienced Data Scientists to help them land their next dream jobs. Apply now: https://www.emmading.com/coaching
// Comment
Got any questions? Something to add?
Write a comment below to chat.
// Let's connect on LinkedIn:
https://www.linkedin.com/in/emmading001/
====================
Contents of this video:
====================
00:00 Introduction
00:48 Categorical Data
02:22 Ordinal Features & Class Labels
03:38 One-Hot Encoding
05:32 Dummy Encoding
06:30 Problems of One-Hot & Dummy Encoding
07:26 Feature Hashing
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
10.5K
Likes
246
Duration
9:39
Published
Dec 19, 2022
User Reviews
4.6
(2) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
No specific trending topics match this video yet.
Explore All Trends