SemanticScuttle - klotz.me » Tags: python+similarity

Tags: python* + similarity*

0 bookmark(s) - Sort by: Date ↓ / Title /

Rensa: A novel high-performance MinHash Implementation in Rust

Rensa is a high-performance MinHash suite written in Rust with Python bindings. It's designed for efficient similarity estimation and deduplication of large datasets. It offers R-MinHash, C-MinHash, and OptDensMinHash variants, significantly faster than datasketch while maintaining comparable accuracy.

2025-06-02 Tags: minhash, document, similarity, deduplication, estimation, rust, python, lsh, datasketch, github, beowolx, text machine learning by klotz
GitHub - tensorflow/similarity: TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

2021-09-14 Tags: tensorflow, similarity, github by klotz
PolyFuzz: String matching, grouping, and evaluation. | Towards Data Science

2020-12-08 Tags: string, matching, machine learning, polyfuzz, python, transformer, levenstein distance, text analysis, similarity, tf-idf, rapidfuzz, fasttext by klotz
duplicate-code-detection-tool/duplicate_code_detection.py at master · platisd/duplicate-code-detection-tool · GitHub

A simple Python3 tool to detect similarities between files within a repository.
Document similarity code adapted from Jonathan Mugan's tutorial:
https://www.oreilly.com/learning/how-do-i-compare-document-similarity-using-python
'''

2020-03-11 Tags: python, code, similarity, tf-idf, document by klotz
word2vec-pride-vis/tsne.py at master · arnicas/word2vec-pride-vis

I used a Python t-SNE library to reduce the 200 feature dimensions for each word to 2 dimensions and plotted them in matplotlib. I saved out the x/y coordinates for each word in the book, so that I can show those words on the graph as you mouse over the replaced (blue) words.

2016-05-19 Tags: t-sne, similarity, dimensionality reduction, pca, python, word2vec, matplotlib by klotz
A Python Implementation Of Simhash Algorithm - Next Spaceship

2014-04-03 Tags: simhash, similarity, python by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle