SemanticScuttle - klotz.me » Tags: text+python

Classifying 4M Reddit Posts in 4k Subreddits: an End-to-end Machine Learning Pipeline This bookmark is certified by an admin user.

2020-04-05 Tags: nlp, classification, reddit, fasttext, python, deep learning by klotz

Comparison of Top 6 Python NLP Libraries This bookmark is certified by an admin user.

2018-07-31 Tags: python, nlp, review by klotz

datalib/libextract · GitHub This bookmark is certified by an admin user.

2016-03-15 Tags: text, extraction, python, github by klotz

dhammack/Word2VecExample This bookmark is certified by an admin user.

Word list odd man out

2013-08-19 Tags: nlp, python by klotz

DIY-Data-Science/gensim.md at master · jxieeducation/DIY-Data-Science · GitHub This bookmark is certified by an admin user.

2016-05-03 Tags: word2vec, howto, gensim, python, nlp by klotz

Document Clustering with Python This bookmark is certified by an admin user.

tokenizing and stemming each synopsis
transforming the corpus into vector space using tf-idf
calculating cosine distance between each document as a measure of similarity
clustering the documents using the k-means algorithm
using multidimensional scaling to reduce dimensionality within the corpus
plotting the clustering output using matplotlib and mpld3
conducting a hierarchical clustering on the corpus using Ward clustering
plotting a Ward dendrogram
topic modeling using Latent Dirichlet Allocation (LDA)

2018-08-16 Tags: lda, document, clustering, python, tf-idf, k-means, nlp, text by klotz

emailinsight/mboxConvert.py at master · andreykurenkov/emailinsight · GitHub This bookmark is certified by an admin user.

2019-06-22 Tags: mail, parsing, text, nlp, machine learning, python, data, sanitization by klotz

Everything You Can Do With Python’s Textwrap Module This bookmark is certified by an admin user.

The TextWrapper class provides functionality for wrapping long pieces of text into multiple shorter lines while preserving the initial and subsequent indents.

2024-02-07 Tags: python, text processing, text, wrapping by klotz

explacy/readme.md at master · tylerneylon/explacy This bookmark is certified by an admin user.

2019-05-01 Tags: spacy, nlp, python, github by klotz

Exploratory Data Analysis, Categorical Data — Part II This bookmark is certified by an admin user.

2019-12-04 Tags: nlp, categorical data, python, sentiment analysis, tfidf by klotz

SemanticScuttle - klotz.me

Tags: text* + python*

Linked Tags

Related Tags