MontrealCorpusTools/kalpy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MontrealCorpusTools/kalpy)

MontrealCorpusTools / kalpy

Pybind11 bindings for Kaldi

☆15

Alternatives and similar repositories for kalpy

Users that are interested in kalpy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
AndreevP / speech_distances
View on GitHub
Deep Speech Distances PyTorch
☆29Feb 21, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated last year
MasonPhonLab / MAPS
View on GitHub
Mason-Alberta Phonetic Segmenter
☆15Feb 24, 2026Updated 4 months ago
frankyoujian / Edge-Punct-Casing
View on GitHub
☆33Feb 4, 2025Updated last year
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
nstory / collection_boxes
View on GitHub
Documenting the current state of USPS Collection Boxes
☆12Sep 3, 2020Updated 5 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
FlorinAndrei / misc
View on GitHub
a catch-all repo
☆11Dec 28, 2023Updated 2 years ago
EveryVoiceTTS / EveryVoice
View on GitHub
The EveryVoice TTS Toolkit - Text To Speech for your language
☆43Updated this week
fgnt / speaker_reassignment
View on GitHub
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆14Feb 5, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
speechcatcher-asr / speechcatcher-data
View on GitHub
☆11Sep 5, 2025Updated 10 months ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
lingjzhu / zipa
View on GitHub
A family of efficient speech models for multilingual phone recognition
☆67Updated this week
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
nk2028 / qieyun-python
View on GitHub
A Python library for the Qieyun phonological system
☆12Apr 1, 2025Updated last year
techczech / phonicsengine
View on GitHub
A phonics API for the English language.
☆16Oct 25, 2015Updated 10 years ago
zjwang21 / mix-phoneme-bert
View on GitHub
An unofficial PyTorch implementation of Mix-Phoneme-Bert
☆40Jul 10, 2023Updated 3 years ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
shreyas253 / SylNet
View on GitHub
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
☆27May 25, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
corticph / error-align
View on GitHub
Text-to-text alignment algorithm for speech recognition error analysis.
☆31Jun 23, 2026Updated 3 weeks ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
uasolo / FDA-DH
View on GitHub
R Code recipes for Functional Data Analysis for phonetic analysis.
☆13Jul 31, 2024Updated last year
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
MarceloSancinetti / epa-gop-pykaldi
View on GitHub
☆25Jun 14, 2022Updated 4 years ago
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
Auroraaa86 / LCS-CTC
View on GitHub
For IEEE ASRU(2025)
☆15Jun 21, 2025Updated last year
colinator / timit_utils
View on GitHub
Python/numpy/pandas convenience wrapper for the TIMIT database.
☆11Nov 26, 2018Updated 7 years ago
hlt-mt / TranscRater
View on GitHub
An open-source tool for automatic speech recognition ASR quality estimation.
☆24Dec 12, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
introlab / uimvdr
View on GitHub
☆13Oct 11, 2024Updated last year
avishaiElmakies / unsupervised_speech_segmentation_using_slm
View on GitHub
☆20Jan 8, 2025Updated last year
rbracco / covidcompare
View on GitHub
Project to map covid19 risk in the US
☆18Jul 19, 2020Updated 6 years ago
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
MiniXC / opensubtitles-dataloader
View on GitHub
Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.
☆13Aug 26, 2020Updated 5 years ago
Helw150 / levanter
View on GitHub
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆16Jun 16, 2024Updated 2 years ago