jermp/tongrams_estimation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jermp/tongrams_estimation)

jermp / tongrams_estimation

A C++ library implementing fast language models estimation using the 1-Sort algorithm.

☆16

Alternatives and similar repositories for tongrams_estimation

Users that are interested in tongrams_estimation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jermp / tongrams
View on GitHub
A C++ library providing fast language model queries in compressed space.
☆132Feb 25, 2023Updated 3 years ago
kampersanda / tongrams-rs
View on GitHub
Rust library providing fast language model queries in compressed space
☆25Oct 1, 2022Updated 3 years ago
davidecenzato / PFP-eBWT
View on GitHub
Implementation of eBWT using Prefix-free parse (PFP)
☆14Jul 14, 2025Updated last year
jermp / essentials
View on GitHub
⚙️🛠️ Essential C++ utilities.
☆14Mar 9, 2026Updated 4 months ago
jermp / autocomplete
View on GitHub
Efficient and effective query auto-completion in C++.
☆56Sep 24, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
koeppl / stringsheet
View on GitHub
A cheatsheet for most common Stringology tasks
☆14Apr 14, 2021Updated 5 years ago
jermp / mutable_rank_select
View on GitHub
A SIMD-based C++ library providing rank/select queries over mutable bitmaps.
☆36Jan 8, 2023Updated 3 years ago
matthewfl / openfst-wrapper
View on GitHub
☆28Jan 29, 2021Updated 5 years ago
jermp / psds
View on GitHub
Efficient Prefix-Sum data structures in C++.
☆28Oct 1, 2023Updated 2 years ago
jermp / interpolative_coding
View on GitHub
A flexible and efficient C++ implementation of the Binary Interpolative Coding algorithm.
☆31Jan 8, 2023Updated 3 years ago
thoppe / deep-phonics
View on GitHub
Deep learning spelling patterns with a recurrent neural network
☆11Jun 5, 2017Updated 9 years ago
turpinandrew / shuff
View on GitHub
Static Huffman coding
☆10Apr 3, 2017Updated 9 years ago
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
kampersanda / poplar-trie
View on GitHub
C++17 implementation of memory-efficient dynamic tries
☆58Feb 15, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
WenchenLi / paper-notes
View on GitHub
paper notes on nlp/cv/rl/dl
☆14May 15, 2017Updated 9 years ago
roberto-trani / mphf_benchmark
View on GitHub
A Benchmark of Minimal Perfect Hash Function Algorithms.
☆39Dec 20, 2022Updated 3 years ago
jermp / mm_file
View on GitHub
A self-contained, header-only, implementation of memory-mapped files in C++ for fast integration into larger projects.
☆40Aug 22, 2024Updated last year
COMBINE-lab / piscem-infer
View on GitHub
☆15May 22, 2026Updated last month
mideind / Icegrams
View on GitHub
A fast, compact trigram library for Icelandic.
☆12Jun 11, 2026Updated last month
HeChinese / OpenHeInput-Android
View on GitHub
Full functional Chinese Input Method for Android, using HeChinese coding system.
☆11Aug 14, 2016Updated 9 years ago
jermp / sshash
View on GitHub
📖 🧬 SSHash is a compressed, associative, exact, and weighted dictionary for k-mers.
☆107May 6, 2026Updated 2 months ago
philolo1 / bitvector
View on GitHub
This is the official repo for the paper "A General Framework for Dynamic Succinct and Compressed Data Structures."
☆13Oct 26, 2016Updated 9 years ago
vicLeva / bqf
View on GitHub
Implementation of a Backpack Quotient Filter
☆13Jun 24, 2026Updated 3 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
shenxiangzhuang / bleuscore
View on GitHub
BLEU Score in Rust
☆12Updated this week
yoichi1484 / subspace
View on GitHub
An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)
☆10May 31, 2024Updated 2 years ago
migumar2 / libCSD
View on GitHub
C++ Library implementing Compressed String Dictionaries
☆47Apr 25, 2022Updated 4 years ago
TomerEven / Prefix-Filter
View on GitHub
Prefix Filter: Practically and Theoretically Better Than Bloom.
☆50Sep 12, 2022Updated 3 years ago
speechpro / mixup
View on GitHub
☆24Mar 13, 2020Updated 6 years ago
vicLeva / tuna
View on GitHub
kmer-counter based on kache-hash
☆16Updated this week
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
se4u / neural_wfst
View on GitHub
Code for the paper 'Weighting Finite State Transductions with Neural Context', Pushpendre Rastogi, Ryan Cotterell, Jason Eisner
☆29May 11, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OndrejSladky / fmsi
View on GitHub
FMSI is a highly memory efficient exact k-mer set index based on masked superstrings and the masked Burrows-Wheeler transform
☆25Nov 20, 2025Updated 8 months ago
algbio / matchtigs
View on GitHub
Minimum plain text representation of kmer sets
☆18Jan 30, 2025Updated last year
mattsse / str-distance
View on GitHub
String Distances in rust
☆14Nov 21, 2022Updated 3 years ago
chenzhehuai / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆24Nov 13, 2019Updated 6 years ago
COMBINE-lab / sshash-rs
View on GitHub
A Rust implementation of SSHash
☆19Jul 2, 2026Updated 2 weeks ago
bonitao / cmph
View on GitHub
☆20Apr 30, 2026Updated 2 months ago
daac-tools / include-bytes-zstd
View on GitHub
Includes a file with zstd compression in Rust
☆14Feb 17, 2023Updated 3 years ago