Easy Data Augmentation Techniques for Text Classification

This video introduces effective baseline methods for data augmentation in NLP and text classification, including synonym replacement and random insertion/deletion.

Connor Shorten5.9K views15:32

🔥 Related Trending Topics

LIVE TRENDS

This video may be related to current global trending topics. Click any trend to explore more videos about what's hot right now!

THIS VIDEO IS TRENDING!

This video is currently trending in Thailand under the topic 'สภาพอากาศ'.

About this video

This video explains a great baseline for exploring data augmentation in NLP and text classification particularly. Synonym replacement, random insertion/deletion/swapping can all be quickly implemented and don't require much overhead (compared to say back-translation or generative modeling). If you put this to use, please share your experience! Thanks for watching! Please Subscribe! Paper Links Easy Data Augmentation: https://arxiv.org/abs/1901.11196 Conditional BERT: https://arxiv.org/abs/1812.06705 RandAugment: https://arxiv.org/abs/1909.13719 CheckList: https://arxiv.org/abs/2005.04118 Chapters 0:00 Beginning 1:08 4 Operations Explored 3:13 Results 4:55 Hyperparameters of Augmentation 6:10 Ablations on Isolated Augmentations 8:53 t-SNE viz of Label Preservation 9:45 Contrast with Conditional BERT 10:41 Issue with Test Set Evaluation of Data Aug 12:02 Connection with RandAugment 13:44 Quick Takeaways from EDA

Video Information

Views
5.9K

Total views since publication

Likes
144

User likes and reactions

Duration
15:32

Video length

Published
Aug 11, 2020

Release date

Quality
hd

Video definition