KoelLabs/ML

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KoelLabs/ML)

KoelLabs / ML

Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learners! This repo contains the ML training, evaluation, and data processing code

☆24

Alternatives and similar repositories for ML

Users that are interested in ML are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

changelinglab / prism
View on GitHub
A toolkit and benchmark for evaluating phonetic capabilities of speech models.
☆18Apr 10, 2026Updated 3 months ago
changelinglab / PhoneticXeus
View on GitHub
A universal phone recognizer that can transcribe speech in 70+ languages into IPA
☆24Jun 9, 2026Updated last month
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
FlorinAndrei / misc
View on GitHub
a catch-all repo
☆11Dec 28, 2023Updated 2 years ago
ai-zahran / E2E-R
View on GitHub
Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
☆29Oct 23, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MontrealCorpusTools / kalpy
View on GitHub
Pybind11 bindings for Kaldi
☆15Jul 11, 2026Updated last week
uasolo / FDA-DH
View on GitHub
R Code recipes for Functional Data Analysis for phonetic analysis.
☆13Jul 31, 2024Updated last year
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
Nyralei / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆12Aug 1, 2025Updated 11 months ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
colinator / timit_utils
View on GitHub
Python/numpy/pandas convenience wrapper for the TIMIT database.
☆11Nov 26, 2018Updated 7 years ago
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
ssakar / tutorial
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
AInixProject / AInix
View on GitHub
Free and Open Platform for AI-assisted Computing
☆10May 19, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
datobs / react-native-perspective-image-cropper
View on GitHub
Perform custom crop, resizing and perspective correction 📐🖼
☆11May 9, 2025Updated last year
gudgud96 / torchcrepeV2
View on GitHub
My own version of crepe, SOTA pitch tracking tool in PyTorch.
☆16Feb 11, 2026Updated 5 months ago
vantezzen / quill-languagetool
View on GitHub
✒️ LanguageTool integration for Quill.js editors
☆17Aug 20, 2024Updated last year
EricWilbanks / faseAlign
View on GitHub
Command line tool for forced-alignment of Spanish speech data
☆13Dec 31, 2025Updated 6 months ago
luferrer / DCA-PLDA
View on GitHub
Discriminative Condition-Aware PLDA
☆46Jul 23, 2024Updated last year
MasonPhonLab / MAPS
View on GitHub
Mason-Alberta Phonetic Segmenter
☆15Feb 24, 2026Updated 4 months ago
crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llm
View on GitHub
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…
☆59May 6, 2024Updated 2 years ago
yanaiela / TNE
View on GitHub
codebase for the Text-based NP Enrichment (TNE) paper
☆19Mar 12, 2024Updated 2 years ago
asappresearch / simple-tts
View on GitHub
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
☆57Oct 31, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
otnemrasordep / ProgGP
View on GitHub
A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.
☆18Nov 19, 2024Updated last year
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
vagrawal / deepsphinx
View on GitHub
☆19Aug 27, 2018Updated 7 years ago
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
sp-uhh / sgmse_crp
View on GitHub
☆32Jan 9, 2024Updated 2 years ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
gsGupta11 / vscode-OpencvSnippets
View on GitHub
A Snippet generator for opencv.
☆10Mar 2, 2024Updated 2 years ago
YuanGongND / gopt
View on GitHub
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆217Feb 13, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
georgid / Lyrics2AudioAligner
View on GitHub
lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping
☆14Mar 14, 2018Updated 8 years ago
classifier-calibration / hands_on
View on GitHub
☆18Sep 15, 2020Updated 5 years ago
shreyas253 / SylNet
View on GitHub
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
☆27May 25, 2023Updated 3 years ago
NeuroLIAA / visions
View on GitHub
Visual Search in Natural Scenes benchmark
☆20Sep 19, 2024Updated last year
sildater / thegluenote
View on GitHub
TheGlueNote is representation model for note-wise music alignment.
☆14Jul 19, 2024Updated 2 years ago
nameless-Chatoyant / singing_voice_separation-pytorch
View on GitHub
☆13Dec 18, 2017Updated 8 years ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago