(eCornell) Clustering Documents With Unsupervised Machine Learning

Training Provider: GENASHTIM
Course Reference: TGS-2023038321
S$1,000

About This Course

In this course, you will focus on measuring distance � the dissimilarity of various documents. The goal is to discover how alike or unlike various groups of text documents are to one another. At scale, this is a problem you might encounter if you need to group thousands of products together purely by using their product description or if you would like to recommend a movie to someone based on whether they liked a different movie. You will work with several different data sets and use both hierarchical and k-means clustering to create clusters, and you will practice with several distance measures to analyze document similarity. Finally, you will create visualizations that help to convey similarity in powerful ways so stakeholders can easily understand the key takeaways of any clustering or distance measure that you create. The course is provided by eCornell in partnership with Genashtim.

What You'll Learn

Analyze term and document similarity using various distance measures
Use and evaluate hierarchical clustering to group similar documents
Use and evaluate k-means clustering to group similar documents and measure quality

Course Details

Duration 15 hours
Language English
Training Commitment Part Time
Total Enrolled New course
Back to All Courses
Note: To apply for this course, visit the SkillsFuture website or contact the training provider directly.

More Courses from GENASHTIM

The framework used in this course for solving fluid dynamics problems can be applied to a wide array...
Duration 12 hours
Fee After Subsidy S$700
While 2D simulations are a good place to begin, many of the real-world applications of simulation re...
Duration 12 hours
Fee After Subsidy S$700
In this course, you will analyze the processes and theories you must consider as you begin to explor...
Duration 12 hours
Fee After Subsidy S$700