The Wayback Machine - https://web.archive.org/web/20210918125738/https://github.com/topics/minhash-lsh-algorithm
Here are
33 public repositories
matching this topic...
End-to-end earthquake detection pipeline via efficient time series similarity search
Updated
Apr 21, 2021
Shell
A Clojure library for querying large data-sets on similarity
Updated
Feb 17, 2019
Clojure
SetSketch: Filling the Gap between MinHash and HyperLogLog
There are Python 2.7 codes and learning notes for Spark 2.1.1
Updated
Aug 21, 2018
Python
A text similarity computation using minhashing and Jaccard distance on reuters dataset
MinHash and LSH index written in Rust for Node.js
Updated
Sep 13, 2021
Rust
insight data engineering fellow project
Updated
Nov 14, 2016
Python
A simple audio fingerprinting system
An improved method of locality-sensitive hashing for scalable instance matching. In this study, we propose a scalable approach for automatically identifying similar candidate instance pairs in very large datasets utilizing minhash-lsh-algorithm in C#.
An easy-to-use script for fast similarity search in the textual data (and embedding space) with GPU & Multi-core support.
Updated
Aug 26, 2019
Python
Fast Jaccard similarity search for abstract sets (documents, products, users, etc.) using MinHashing and Locality Sensitve Hashing
Updated
May 21, 2020
Python
Minhash clustering of text documents
Updated
Sep 29, 2017
Scala
📃 Document similarity detection using hashing
Scalable Data Mining - Assignment submissions
Updated
Dec 11, 2017
Python
Implementation of a B+ Tree for range and exact match queries and of the LSH algorithm for finding similar documents as measured by Jaccard Similarity.
Updated
Feb 19, 2021
Python
documents my master's level thesis work on building continous, topical web crawler based on mercator 1999
A set of methods and model evaluation metrics for predicting links in an academic citation network using Apache Spark and Scala
Updated
Nov 3, 2020
Scala
Finding Similar Pairs using PySpark
Updated
Jan 10, 2021
Jupyter Notebook
Updated
Sep 17, 2016
Java
Project 1: Similar document searching via MinHash and Locality Sensitive Hashing
Updated
Dec 10, 2018
Jupyter Notebook
Updated
Mar 12, 2018
Jupyter Notebook
Updated
Apr 14, 2018
Jupyter Notebook
Recommendation systems for Yelp (collaborative filtering & content-based)
Updated
Mar 28, 2020
Python
An implementation of the MinHashing algorithm in C using POSIX threads.
Probability Methods for Informatics Engineering | UA 2018/2019
Updated
Jan 22, 2020
Java
similarity of the texts (Jaccard Similarity, Minhash, LSH)
Updated
Feb 21, 2021
Python
Updated
Mar 25, 2019
XSLT
SpellChecker: an application to check for spell errors.
Updated
Apr 13, 2021
Java
Minhash text analyzer developed during Algorithmics subject.
Updated
Apr 12, 2018
Python
Improve this page
Add a description, image, and links to the
minhash-lsh-algorithm
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
minhash-lsh-algorithm
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.