Java implementation for MinHash and LSH for finding near duplicate documents as measured by Jaccard similarity.
☆32Mar 30, 2015Updated 10 years ago
Alternatives and similar repositories for MinHashLSH
Users that are interested in MinHashLSH are comparing it to the libraries listed below
Sorting:
- A Java implementation of Locality Sensitive Hashing (LSH)☆301Nov 19, 2022Updated 3 years ago
- Easy-to-use Java library for similarity checking of strings or numeric-series☆20Jan 23, 2020Updated 6 years ago
- Implementation of the Apriori algorithm using Spark.☆38Nov 9, 2014Updated 11 years ago
- ☆10May 16, 2022Updated 3 years ago
- Parallel programs with OpenMPI☆10Apr 1, 2015Updated 10 years ago
- Reusable shiny modules☆12Jan 29, 2016Updated 10 years ago
- ☆10Jul 5, 2016Updated 9 years ago
- Companion source code for GTC 2014 talk☆11Mar 25, 2014Updated 11 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- Footstep planning and Trajectory Optimization☆10Apr 12, 2015Updated 10 years ago
- 抓取国家统计局数据☆13May 4, 2016Updated 9 years ago
- Symbolic range analysis for LLVM.☆12Jan 10, 2016Updated 10 years ago
- Doing research on top of Jalangi☆12Sep 9, 2016Updated 9 years ago
- ☆10Apr 15, 2023Updated 2 years ago
- A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role☆10Jan 9, 2026Updated last month
- a list of links to help you make various important architectural decisions☆11Jul 13, 2016Updated 9 years ago
- A tensorflow implementation of a series of deep learning methods to predict CTR, including FM, FNN, NFM, Attention-based NFM, Attention-b…☆11Jul 29, 2019Updated 6 years ago
- Training a YOLO NAS Model for detecting retail product items from shelf images using SKU110K dataset.☆10Aug 13, 2023Updated 2 years ago
- 人人好友关系网络☆35Feb 1, 2016Updated 10 years ago
- Toy implementation of SLIM and SSLIM Recommendation methods.☆42May 23, 2018Updated 7 years ago
- ☆11Nov 14, 2020Updated 5 years ago
- my own R course☆11Oct 14, 2014Updated 11 years ago
- Code for the CIKM 2013 paper "Discovering Coherent Topics Using General Knowledge"☆11Jul 14, 2014Updated 11 years ago
- hexapod simulator☆11May 24, 2015Updated 10 years ago
- A key/value database based on SkimpyStash.☆13Jun 11, 2015Updated 10 years ago
- Unsupervised Lifelong Person Re-identification via Contrastive Rehearsal☆11Apr 7, 2022Updated 3 years ago
- Multithreaded HTTP Download Accelerator☆23Jul 27, 2014Updated 11 years ago
- ☆12Aug 6, 2023Updated 2 years ago
- My fork of zerofrog's fast SIFT C++ reimplementation of Bill Lowe's original smash-hit image-analysis algorithm.☆21Sep 19, 2012Updated 13 years ago
- Clustering documents based on LSH☆14Apr 20, 2016Updated 9 years ago
- a c89 compiler, need total test.☆29Jan 20, 2018Updated 8 years ago
- PyData NYC 2015 tutorial examples☆12Feb 15, 2016Updated 10 years ago
- Algorithms from the book "Elements of Statistical Learning", implemented in Python☆12Mar 29, 2015Updated 10 years ago
- A workshop on how to select your data for annotation☆12Jul 20, 2021Updated 4 years ago
- simple arbitrage☆13Jul 29, 2010Updated 15 years ago
- Go Share your TimeSeries/NameSpace/KeyVal DataStore (using leveldb) over HTTP &/or ZeroMQ☆62Oct 28, 2015Updated 10 years ago
- MineGate, written in go.☆10Apr 3, 2015Updated 10 years ago
- a minimal responsive octopress theme☆19Jun 16, 2017Updated 8 years ago
- Distributed session storage system for Jetty.☆21May 13, 2012Updated 13 years ago