kzn / fsa
Deterministic Acyclic Finite State Automaton implementation for morphological analysis
☆18Updated 4 years ago
Alternatives and similar repositories for fsa:
Users that are interested in fsa are comparing it to the libraries listed below
- This is a minimal acyclic finite-state automata algorithm in Java based on the paper, "Incremental Construction of Minimal Acyclic Finite…☆19Updated 11 years ago
- A framework for building reranking models.☆28Updated 9 years ago
- Official repository of Quickscorer: a fast algorithm to rank documents with additive ensembles of regression trees.☆18Updated 8 years ago
- ☆15Updated 7 years ago
- Implementation of QuadSketch algorithm☆11Updated 2 years ago
- Suite of universal indexes for Highly Repetitive Document Collections☆20Updated 4 years ago
- SALM: Suffix Array and its Applications in Empirical Language Processing by Joy☆11Updated 7 years ago
- Code used for the experiments in the paper "Partitioned Elias-Fano Indexes"☆40Updated 10 years ago
- JSuffixArrays (Suffix Arrays in Java)☆59Updated 8 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 3 years ago
- SUccinct Retrieval Framework☆20Updated 9 years ago
- Experiments on bit-slice indexing☆13Updated 10 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Updated 9 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated last year
- A flexible tree-based index structure to support edit distance search on strings☆11Updated 7 years ago
- This is a fork of optimization part of RISO project (http://riso.sourceforge.net/)☆13Updated 9 years ago
- Experimental search engine in C/C++17 - still in early development.☆27Updated 6 months ago
- Fast implementations of the scancount algorithm: C++ header-only library☆26Updated 5 years ago
- Feed-forward Bloom filters☆52Updated 13 years ago
- Anytime Ranking for Impact-Ordered Indexes☆13Updated 8 years ago
- Implementation of the data structures described in the paper "Fast Compressed Tries using Path Decomposition".☆56Updated 2 years ago
- ☆26Updated 8 years ago
- A collection of generic, C++ Bloom Filter classes developed for the Boost C++ Libraries.☆23Updated 7 years ago
- Efficient and effective query auto-completion in C++.☆54Updated last year
- Java library for Concrete, a data serialization format for NLP☆6Updated 5 years ago
- Sketch Library for vector-based models☆14Updated 2 weeks ago
- ByteBuffer collection classes for java and jvm-based languages.☆33Updated 7 years ago
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆34Updated last year
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago