This course will introduce the learner to text mining and text manipulation basics. The course begins with an understanding of how text is handled by python, the structure of text both to the machine and to humans, and an overview of the nltk framework for manipulating text. The second week focuses on common manipulation needs, including regular expressions (searching for text), cleaning text, and preparing text for use by machine learning processes. The third week will apply basic natural language processing methods to text, and demonstrate how text classification is accomplished. The final week will explore more advanced methods for detecting the topics in documents and grouping them by similarity (topic modelling). This course should be taken after: Introduction to Data Science in Python, Applied Plotting, Charting & Data Representation in Python, and Applied Machine Learning in Python.
Created by: University of Michigan
Taught by: V. G. Vinod Vydiswaran, Assistant Professor
The course is six modules long and is designed to be completed in six weeks.
Module 1:
Working with Text in Python
Module 2:
Basic Natural Language Processing
Module 3:
Classification of Text
Module 4:
Topic Modeling
It’s a tough reality: every year, over 14.1 million workers suffer from work-related injuries. For…
If you’ve ever wanted to learn how to cook, but didn’t know where to start,…
Choosing the right career path can be a daunting task, especially with the myriad of…
Believe it or not, the concept of human resources has existed for more than 100…
Web3 managed to change the gaming industry by leveraging blockchain technology. It offers a decentralized…
College is often fun and is filled with lots of activities, especially in the first…