Pybind11 bindings for Kaldi
☆15Feb 1, 2026Updated last month
Alternatives and similar repositories for kalpy
Users that are interested in kalpy are comparing it to the libraries listed below
Sorting:
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 4 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- A phonics API for the English language.☆14Oct 25, 2015Updated 10 years ago
- ☆29Feb 4, 2025Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- Mason-Alberta Phonetic Segmenter☆15Feb 24, 2026Updated 3 weeks ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆43Mar 13, 2026Updated last week
- Documenting the current state of USPS Collection Boxes☆12Sep 3, 2020Updated 5 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- ☆11Sep 5, 2025Updated 6 months ago
- ☆19Updated this week
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- A Python library for the Qieyun phonological system☆11Apr 1, 2025Updated 11 months ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- ☆32Oct 23, 2025Updated 4 months ago
- ☆13Sep 25, 2024Updated last year
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 2 years ago
- ☆12Mar 11, 2025Updated last year
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Mar 14, 2026Updated last week
- ☆25Jun 14, 2022Updated 3 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- ☆19Jan 8, 2025Updated last year
- ☆13Oct 11, 2024Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆32Aug 22, 2024Updated last year
- Python/numpy/pandas convenience wrapper for the TIMIT database.☆11Nov 26, 2018Updated 7 years ago
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 7 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year