SemanticScuttle - klotz.me » klotz: k-means

klotz: k-means*

“Understanding K-means Clustering in Machine Learning”

2018-09-13 Tags: k-means, clustering, python, machine learning by klotz

ow can you learn about the underlying structure of documents in a way that is informative and intuitive? This basic motivating question led me on a journey to visualize and cluster documents in a two-dimensional space. What you see above is an output of an analytical pipeline that begin by gathering synopses on the top 100 films of all time and ended by analyzing the latent topics within each document. In between I ran significant manipulations on these synopses (tokenization, stemming), transformed them into a vector space model (tf-idf), and clustered them into groups (k-means). You can learn all about how I did this with my detailed guide to Document Clustering with Python. But first, what did I learn?

2016-06-02 Tags: lda, nlp, clustering, k-means, cosine similarity, imdb, movies, tf-idf by klotz

Document Clustering with Python

tokenizing and stemming each synopsis
transforming the corpus into vector space using tf-idf
calculating cosine distance between each document as a measure of similarity
clustering the documents using the k-means algorithm
using multidimensional scaling to reduce dimensionality within the corpus
plotting the clustering output using matplotlib and mpld3
conducting a hierarchical clustering on the corpus using Ward clustering
plotting a Ward dendrogram
topic modeling using Latent Dirichlet Allocation (LDA)

2018-08-16 Tags: lda, document, clustering, python, tf-idf, k-means, nlp, text by klotz

How-To: OpenCV and Python K-Means Color Clustering

2014-05-30 Tags: k-means, python, color, machine learning, opencv by klotz

Visualizing K-Means Clustering

2014-01-25 Tags: k-means, clustering, visualization by klotz

SOM Toolbox

2012-11-28 Tags: hut, k-means, machine learning, som, visualization by klotz

Machine Learning: k-Means Clustering in Javascript Part 1 | Burak Kanber's Blog

2012-10-16 Tags: javascript, k-means, machine learning by klotz

The Glowing Python: K- means clustering with scipy

2012-07-09 Tags: k-means, machine learning, python, scipy by klotz

The Glowing Python: Color quantization