Diamondfan/Child-ASR-Paper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Diamondfan/Child-ASR-Paper)

Diamondfan / Child-ASR-Paper

A list of papers for child ASR

☆54

Alternatives and similar repositories for Child-ASR-Paper

Users that are interested in Child-ASR-Paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rishabhjain16 / whisper_child_asr
View on GitHub
☆12May 23, 2023Updated 3 years ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
fclearner / Personal-vad-2.0
View on GitHub
Implementation of "Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition"
☆16Jun 9, 2026Updated last month
datemoon / tf-code-acoustics
View on GitHub
it's a train acoustics model code lib
☆27May 20, 2020Updated 6 years ago
halsay / ASR-TTS-paper-daily
View on GitHub
Update ASR paper everyday
☆513May 16, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ftshijt / Interspeech2024_DiscreteSpeechChallenge
View on GitHub
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Jan 26, 2024Updated 2 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
NiniAndy / Paraformer-V2
View on GitHub
来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
☆29Nov 20, 2024Updated last year
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
usc-sail / child-adult-diarization
View on GitHub
public child-adult speaker diarization/classification model and codes
☆19Apr 24, 2025Updated last year
ftshijt / speech_evaluation
View on GitHub
A toolkit dedicate for speech evaluation.
☆23Sep 26, 2024Updated last year
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
bookbot-hive / k2-indonesian-asr
View on GitHub
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆16Jun 30, 2023Updated 3 years ago
TakHemlata / T-EER
View on GitHub
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
☆14Sep 25, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
xi-j / Style-Talker
View on GitHub
An official implementation of Style-Talker for Spoken Dialogue Generation
☆23Jan 12, 2025Updated last year
lovemefan / Silero-vad-pytorch
View on GitHub
silero-vad pytorch implement
☆38Nov 23, 2024Updated last year
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
BUTSpeechFIT / DVBx
View on GitHub
Discriminative Training of VBx Diarization
☆28Sep 23, 2024Updated last year
MarvinLvn / BabySLM
View on GitHub
Behavioral probing of language acquisition models at the lexical and syntactic level
☆20Jul 17, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆35Updated this week
zldzmfoq12 / VCtube
View on GitHub
A pakage for crawling audio from Youtube
☆42Aug 8, 2023Updated 2 years ago
lhwcv / self_attention_alignment
View on GitHub
Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement
☆39Jul 25, 2023Updated 3 years ago
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 3 months ago
hongwen-sun / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆15Dec 19, 2018Updated 7 years ago
keunwoochoi / tokenizer-vs-tokenizer
View on GitHub
☆14Oct 18, 2023Updated 2 years ago
jhdeov / interlingual-MFA
View on GitHub
Workflow for forced alignment between languages
☆25May 7, 2026Updated 2 months ago
yukara-ikemiya / wavefit-pytorch
View on GitHub
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
☆70Jul 13, 2026Updated last week
kssteven418 / Q-ASR
View on GitHub
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
☆34Oct 11, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
makerjackie / tts-frontend-dataset
View on GitHub
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆104Feb 5, 2024Updated 2 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
NKU-HLT / SpeechLLM-as-Judges
View on GitHub
[ACL 2026]
☆25Dec 6, 2025Updated 7 months ago
dqqcasia / mosst
View on GitHub
☆27Aug 31, 2022Updated 3 years ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
chenpk00 / IS2024_stream_decoder_only_asr
View on GitHub
☆16Mar 12, 2024Updated 2 years ago