Rasa Whiteboard: NER for PII Detection Challenges

Exploring the challenges of using NER algorithms in Rasa to identify personally identifiable information in text. 📝

Rasa Whiteboard: NER for PII Detection Challenges
Rasa
2.2K views • Jan 4, 2022
Rasa Whiteboard: NER for PII Detection Challenges

About this video

Detecting personally identifiable information from text is typically a NER problem. It's an important problem, especially when you're thinking about labeling pipelines, but it's also very hard to get right. The goal of this video is to explain why it's such a hard problem and we'll poke at some systems to see how they might fail.

If you'd like to learn more about Presidio, you can find the project on GitHub here:
https://github.com/microsoft/presidio

You can find the live demo used in this video here:
https://presidio-demo.azurewebsites.net/

You can find more information about the equity evaluation corpus here:
https://saifmohammad.com/WebPages/Biases-SA.html

You can play with the `ner-english` Flair model on huggingface here:
https://huggingface.co/flair/ner-english

Video Information

Views

2.2K

Likes

45

Duration

14:20

Published

Jan 4, 2022

User Reviews

4.5
(2)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.