(eCornell) Classifying Documents With Supervised Machine Learning
About This Course
In this course, you will start to use machine learning methods to further your exploration of document term matrices (DTM). You will use a DTM to create train and test sets with the scikit-learn package in Python � an important first step in categorizing different documents. You will also examine different models, determining how to select the most appropriate model for your particular natural language processing task. Finally, after you have chosen a model, trained it, and tested it, you will work with several evaluation metrics to measure how well your model performed. The technical skills and evaluation processes you study in the course will provide valuable experience for the workplace and beyond. The course is provided by eCornell in partnership with Genashtim.
What You'll Learn
Train classification models to categorize documents
Evaluate the model on the test set to measure how well it generalizes