A C++ library providing fast language model queries in compressed space.
☆132Feb 25, 2023Updated 3 years ago
Alternatives and similar repositories for tongrams
Users that are interested in tongrams are comparing it to the libraries listed below
Sorting:
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17May 18, 2023Updated 2 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Rust library providing fast language model queries in compressed space☆25Oct 1, 2022Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Efficient Prefix-Sum data structures in C++.☆26Oct 1, 2023Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆15Jun 28, 2022Updated 3 years ago
- Efficient and effective query auto-completion in C++.☆57Sep 24, 2023Updated 2 years ago
- Go implementation of SIMD-BP128 integer encoding and decoding☆31Apr 8, 2022Updated 3 years ago
- Go implementation of libhydrogen - a lightweight, easy-to-use crypto library☆24Mar 23, 2017Updated 8 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Official repository of the ACM SIGIR 2019 paper: "Fast Approximate Filtering of Search Results Sorted by Attribute" by Franco Maria Nardi…☆14Nov 7, 2019Updated 6 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- C++ Library implementing Compressed String Dictionaries☆46Apr 25, 2022Updated 3 years ago
- Space-Efficient, High-Performance Rank & Select Structures on Uncompressed Bit Sequences☆15Aug 7, 2018Updated 7 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Stupid crypto tricks☆18May 12, 2021Updated 4 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- C++17 implementation of memory-efficient dynamic tries☆58Feb 15, 2022Updated 4 years ago
- A SIMD-based C++ library providing rank/select queries over mutable bitmaps.☆36Jan 8, 2023Updated 3 years ago
- ☆57Apr 18, 2023Updated 2 years ago
- Suite of universal indexes for Highly Repetitive Document Collections☆24May 14, 2020Updated 5 years ago
- Statically-typed localization messages.☆10Oct 11, 2020Updated 5 years ago
- Radix sorting in Go☆10Feb 4, 2019Updated 7 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Jun 17, 2021Updated 4 years ago
- shoco is a compressor for small text strings. [Not maintained].☆10Sep 4, 2019Updated 6 years ago
- Implementation of eBWT using Prefix-free parse (PFP)☆14Jul 14, 2025Updated 7 months ago
- ☆11Nov 5, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 7 years ago
- Keyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.☆257Updated this week
- ☆16Jun 13, 2022Updated 3 years ago
- Smart Language Model☆47Dec 21, 2022Updated 3 years ago
- Covering grammars for English and Russian text normalization☆61Sep 15, 2019Updated 6 years ago
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆92Aug 6, 2022Updated 3 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- similarity join and search algorithms for edit distance and jaccard☆19Dec 17, 2017Updated 8 years ago