Java implementation for MinHash and LSH for finding near duplicate documents as measured by Jaccard similarity.
☆33Mar 30, 2015Updated 11 years ago
Alternatives and similar repositories for MinHashLSH
Users that are interested in MinHashLSH are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Natural Language Processing algorithm including TextClassification, sentiment analysis, TextRank, LDA and so on☆12Mar 23, 2017Updated 9 years ago
- ☆12Sep 14, 2021Updated 4 years ago
- A LaTeX package cocktail for grad school level writing/presentation☆13Feb 11, 2021Updated 5 years ago
- ☆11May 16, 2022Updated 3 years ago
- ☆11May 25, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A React component to implement continuous scrolling (for modern browser).☆17Jan 12, 2017Updated 9 years ago
- This is the FER+ new label annotations for the Emotion FER dataset.☆16Mar 9, 2018Updated 8 years ago
- The official implementation of EMNLP 2021 paper "#HowYouTagTweets: Learning User Hashtagging Preferences via Personalized Topic Attention…☆11Feb 21, 2023Updated 3 years ago
- 数据挖掘十大算法Java实现。☆23Sep 18, 2018Updated 7 years ago
- 计算TFIDF的三种方法:Python、sklearn、gensim☆11Feb 26, 2019Updated 7 years ago
- Bitwise analysis tools☆16Feb 5, 2019Updated 7 years ago
- An old and super slow python implementation of HMM trigram POS tagger.☆17Mar 23, 2014Updated 12 years ago
- A tensorflow implementation of a series of deep learning methods to predict CTR, including FM, FNN, NFM, Attention-based NFM, Attention-b…☆11Jul 29, 2019Updated 6 years ago
- 基于词典的负面舆情信息评分算法。☆26Dec 16, 2014Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Article from Medium about Push Notifications in Android☆14Dec 27, 2015Updated 10 years ago
- 自然语言处理之CFG句法分析☆10Mar 27, 2018Updated 8 years ago
- ALBERT Text Classification Tensorflow, Resume Classification☆15Mar 28, 2020Updated 6 years ago
- Implementation of the Apriori algorithm using Spark.☆38Nov 9, 2014Updated 11 years ago
- A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role☆10Jan 9, 2026Updated 4 months ago
- A Locality-Sensitive Hashing Library for Scala with optional Redis storage.☆16Jan 5, 2022Updated 4 years ago
- 一行代码使用BERT生成句向量,BERT做文本分类、文本相似度计算☆10Jul 1, 2019Updated 6 years ago
- Google's Natural Language Processing model with SOT result in various tasks☆16Jun 12, 2023Updated 2 years ago
- ☆16Nov 17, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Apache Solr Client for Scala/Java☆51Jan 11, 2016Updated 10 years ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- a column file format☆134Sep 25, 2012Updated 13 years ago
- pairwise learning to rank with logistic regression☆19Apr 24, 2016Updated 10 years ago
- Tensorflow 1.8 with CUDA on macOS High Sierra 10.13.6☆20Aug 27, 2018Updated 7 years ago
- ☆10Apr 16, 2022Updated 4 years ago
- Using NLP techniques to summarize prompts for program synthesis☆17Sep 26, 2023Updated 2 years ago
- Open Source Implementation of Simhash in Python☆24Sep 14, 2017Updated 8 years ago
- code for the paper "Personalized Context-Aware Re-ranking for E-commerce Recommendation Systems"☆51Jan 23, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python library implementing recommender systems algorithms with http://tensorflow.org☆12Dec 21, 2018Updated 7 years ago
- In-memory Neo4j server for testing using the ImpermanentGraphDatabase☆25Feb 14, 2017Updated 9 years ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Apr 15, 2015Updated 11 years ago
- 编译语言实现模式例程☆11Nov 22, 2014Updated 11 years ago
- Mirror kept for legacy. Moved to https://github.com/llvm/llvm-project☆17Dec 14, 2016Updated 9 years ago
- Distantly Supervised Biomedical Named Entity Recognition with Dictionary Expansion: https://ieeexplore.ieee.org/document/8983212☆13Jun 4, 2020Updated 5 years ago
- ☆55Jul 2, 2017Updated 8 years ago