A C++ library providing fast language model queries in compressed space.
☆132Feb 25, 2023Updated 3 years ago
Alternatives and similar repositories for tongrams
Users that are interested in tongrams are comparing it to the libraries listed below
Sorting:
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆16May 18, 2023Updated 2 years ago
- Efficient Prefix-Sum data structures in C++.☆27Oct 1, 2023Updated 2 years ago
- Official repository of the ACM SIGIR 2019 paper: "Fast Approximate Filtering of Search Results Sorted by Attribute" by Franco Maria Nardi…☆14Nov 7, 2019Updated 6 years ago
- Rust library providing fast language model queries in compressed space☆25Oct 1, 2022Updated 3 years ago
- Space-Efficient, High-Performance Rank & Select Structures on Uncompressed Bit Sequences☆15Aug 7, 2018Updated 7 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆15Jun 28, 2022Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- A SIMD-based C++ library providing rank/select queries over mutable bitmaps.☆36Jan 8, 2023Updated 3 years ago
- C++ Library implementing Compressed String Dictionaries☆47Apr 25, 2022Updated 3 years ago
- shoco is a compressor for small text strings. [Not maintained].☆10Sep 4, 2019Updated 6 years ago
- Efficient and effective query auto-completion in C++.☆56Sep 24, 2023Updated 2 years ago
- Implementation of eBWT using Prefix-free parse (PFP)☆14Jul 14, 2025Updated 8 months ago
- Go implementation of SIMD-BP128 integer encoding and decoding☆31Apr 8, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- C++17 implementation of memory-efficient dynamic tries☆58Feb 15, 2022Updated 4 years ago
- A new lossless data compression algorithm☆12Nov 19, 2025Updated 4 months ago
- Suite of universal indexes for Highly Repetitive Document Collections☆24May 14, 2020Updated 5 years ago
- Prefix Filter: Practically and Theoretically Better Than Bloom.☆49Sep 12, 2022Updated 3 years ago
- Go implementation of libhydrogen - a lightweight, easy-to-use crypto library☆24Mar 23, 2017Updated 8 years ago
- C++ Implementation of Zip Trees☆14Nov 5, 2022Updated 3 years ago
- A collection of succinct data structures☆213Jan 3, 2024Updated 2 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 8 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Compact Tree Representation☆16Mar 16, 2017Updated 9 years ago
- A flexible and efficient C++ implementation of the Binary Interpolative Coding algorithm.☆31Jan 8, 2023Updated 3 years ago
- Keyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.☆257Updated this week
- Radix sorting in Go☆10Feb 4, 2019Updated 7 years ago
- Optimal partitioning of Variable-Byte codes for better compression and fast decoding.☆17Nov 10, 2021Updated 4 years ago
- Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model☆24Oct 28, 2023Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Universe-sliced indexes in C++.☆18Jan 8, 2023Updated 3 years ago
- Search engine postings list with support for compresison☆11May 5, 2017Updated 8 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago