weblyzard / nilsimsa
A Java library for computing and comparing Nilsimsa string similarity hashes.
☆10Updated 2 years ago
Alternatives and similar repositories for nilsimsa:
Users that are interested in nilsimsa are comparing it to the libraries listed below
- This module contains an implementation of the Nilsimsa locality-sensitive hashing algorithm in Java.☆18Updated 5 years ago
- Lucene Auto Phrase TokenFilter implementation☆59Updated 6 years ago
- Dice Solr Plugins from Simon Hughes Dice.com☆87Updated 3 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 9 years ago
- command line tool for Apache Lucene☆161Updated 3 weeks ago
- ElasticSearch Prediction Generator and Plugin☆22Updated 9 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- Elasticsearch plugin for b-bit minhash algorism☆62Updated 9 months ago
- Topic Modeling with LDA in Scala and Spark☆31Updated 6 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆28Updated 6 years ago
- (deprecated) Please use new nlp4l instead.☆66Updated 8 years ago
- Mahout vector encoding for pig☆54Updated 2 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 5 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- Search a single field with different query time analyzers in Solr☆25Updated 5 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- Storm Cassandra Integration☆179Updated last year
- Graph Processing Algorithms on top of Neo4j☆39Updated 7 years ago
- A Stanford CoreNLP server, with example clients, using Apache Thrift.☆47Updated 6 years ago
- Sample migration from Titan 0.5.4 to Titan 1.0.0☆17Updated 9 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 7 years ago
- Document clustering based on Latent Semantic Analysis☆96Updated 14 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Keyword extraction package for Spark.☆12Updated 8 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year