A C++ library implementing fast language models estimation using the 1-Sort algorithm.
☆16May 18, 2023Updated 2 years ago
Alternatives and similar repositories for tongrams_estimation
Users that are interested in tongrams_estimation are comparing it to the libraries listed below
Sorting:
- A C++ library providing fast language model queries in compressed space.☆132Feb 25, 2023Updated 3 years ago
- Rust library providing fast language model queries in compressed space☆25Oct 1, 2022Updated 3 years ago
- Implementation of eBWT using Prefix-free parse (PFP)☆14Jul 14, 2025Updated 8 months ago
- ⚙️🛠️ Essential C++ utilities.☆14Mar 9, 2026Updated last week
- Universe-sliced indexes in C++.☆18Jan 8, 2023Updated 3 years ago
- Command line parser for C++17.☆29Dec 10, 2024Updated last year
- Efficient and effective query auto-completion in C++.☆56Sep 24, 2023Updated 2 years ago
- A self-contained, header-only, implementation of memory-mapped files in C++ for fast integration into larger projects.☆27Aug 22, 2024Updated last year
- A cheatsheet for most common Stringology tasks☆14Apr 14, 2021Updated 4 years ago
- A SIMD-based C++ library providing rank/select queries over mutable bitmaps.☆36Jan 8, 2023Updated 3 years ago
- ☆28Jan 29, 2021Updated 5 years ago
- Efficient Prefix-Sum data structures in C++.☆27Oct 1, 2023Updated 2 years ago
- A flexible and efficient C++ implementation of the Binary Interpolative Coding algorithm.☆31Jan 8, 2023Updated 3 years ago
- Static Huffman coding☆10Apr 3, 2017Updated 8 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- C++17 implementation of memory-efficient dynamic tries☆58Feb 15, 2022Updated 4 years ago
- paper notes on nlp/cv/rl/dl☆14May 15, 2017Updated 8 years ago
- Optimal distance lower bound k-mer sampling.☆12Jun 19, 2024Updated last year
- A Benchmark of Minimal Perfect Hash Function Algorithms.☆38Dec 20, 2022Updated 3 years ago
- ☆15Mar 11, 2026Updated last week
- A fast, compact trigram library for Icelandic.☆11Dec 12, 2025Updated 3 months ago
- A (Minimal) Perfect Hash Function based on irregular cuckoo hashing, retrieval, and overloading.☆21Sep 30, 2025Updated 5 months ago
- Official repository of the ACM SIGIR 2019 paper: "Fast Approximate Filtering of Search Results Sorted by Attribute" by Franco Maria Nardi…☆14Nov 7, 2019Updated 6 years ago
- 📖 🧬 SSHash is a compressed, associative, exact, and weighted dictionary for k-mers.☆94Updated this week
- This is the official repo for the paper "A General Framework for Dynamic Succinct and Compressed Data Structures."☆13Oct 26, 2016Updated 9 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- BLEU Score in Rust☆12Mar 1, 2026Updated 2 weeks ago
- ☆24Mar 13, 2020Updated 6 years ago
- Implementation of a Backpack Quotient Filter☆13Jul 7, 2025Updated 8 months ago
- Prefix Filter: Practically and Theoretically Better Than Bloom.☆49Sep 12, 2022Updated 3 years ago
- Various C programs I have created from the https://learncgames.com website☆12Mar 5, 2021Updated 5 years ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆15Jun 28, 2022Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- Minimum plain text representation of kmer sets☆16Jan 30, 2025Updated last year
- Code for the paper 'Weighting Finite State Transductions with Neural Context', Pushpendre Rastogi, Ryan Cotterell, Jason Eisner☆29May 11, 2019Updated 6 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- FMSI is a highly memory efficient exact k-mer set index based on masked superstrings and the masked Burrows-Wheeler transform☆24Nov 20, 2025Updated 4 months ago
- String Distances in rust☆14Nov 21, 2022Updated 3 years ago
- Rust implementations of succinct data structures☆69Updated this week