guokr / simbaseLinks
A vector similarity database
☆231Updated 10 years ago
Alternatives and similar repositories for simbase
Users that are interested in simbase are comparing it to the libraries listed below
Sorting:
- ☆37Updated 6 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆56Updated 8 years ago
- Elasticsearch plugin for b-bit minhash algorism☆63Updated 11 months ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20Updated 9 years ago
- Plugin to integrate approximate nearest neighbor(ANN) search with Elasticsearch☆66Updated 6 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.☆27Updated 11 years ago
- This tool extracts word vectors from Lucene index.☆135Updated 7 years ago
- Flowmix is a flexible event processing engine for Apache Storm. It supports complex correlations of events via sliding/tumbling windows. …☆58Updated 9 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 4 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)☆43Updated 5 years ago
- An example how to implement a custom similarity (overlap similarity) for elasticsearch☆41Updated 9 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- A simple scoring plugin for vector in Elasticsearch.☆69Updated 8 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 7 years ago
- Dockerfiles for fastText☆68Updated 3 years ago
- GraphPipe helpers for TensorFlow☆22Updated 6 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Score documents with pure dot product / cosine similarity with ES☆251Updated 3 years ago
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆103Updated 2 weeks ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Spark algorithms for building k-nn graphs☆42Updated 6 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 10 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 10 years ago
- The Kiji project suite☆33Updated 9 years ago
- Java library for authoring PMML☆16Updated 2 months ago
- Scripts and codes for replicating experiments published in Exploring Topic Coherence over many models and many topics☆82Updated 2 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago