skhu101 / Bayesian_TDNNLinks

This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition", IEEE/ACM Transactions on Audio Speech and Language (TASLP).

☆9

Alternatives and similar repositories for Bayesian_TDNN

Users that are interested in Bayesian_TDNN are comparing it to the libraries listed below

Sorting:

luomingshuang / k2-speechbrain
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Updated 3 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
SpeechColab / PySpeechColab
A library of speech gadgets.
☆13Updated 2 years ago
qiujiali / lattice-rescore
☆16Updated 3 years ago
rhoposit / icassp2021
☆15Updated 4 years ago
ttslr / MonTTS
☆13Updated 3 years ago
speechio / asr-noises
A handy dataset of noises for ASR
☆22Updated 6 years ago
csukuangfj / kaldi-hmm-gmm
☆25Updated 9 months ago
thu-spmi / SPMILM
A SPMI Lab toolkit for language models.
☆11Updated 8 years ago
wenet-e2e / WeSpeech-AI
Open Source Speech/Text Data on AI
☆18Updated 2 years ago
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
NeuroWave-ai / CUCVAE-TTS
☆25Updated 3 years ago
cpii-cai / PunCantonese
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆14Updated 8 months ago
pkufool / simple-wer
A simple command line tool to calculate WER for ASR.
☆14Updated 9 months ago
amphionspace / tts-evaluation
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Updated 11 months ago
bshall / dusted
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Updated 10 months ago
gpu-poor / gramvaani_hindi_asr
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Updated 3 years ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
BUTSpeechFIT / TS_SUPERB
☆15Updated 4 months ago
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated last year
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆24Updated last month
p1an-lin-jung / wv_tts
☆19Updated last year
idiap / zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆21Updated last year
exercise-book-yq / FreeCodec
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆22Updated 11 months ago
ljuvela / GELP
☆26Updated 4 years ago
wentaozhu / speechnas
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Updated 2 years ago
alumae / torch-xvectors-wav
☆22Updated 4 years ago
babe269 / performant
A toolset for easy formant extraction and visualization from wav files and TTS models
☆31Updated 2 years ago
WangHelin1997 / Automatic_Speech_Annotator
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Updated last year
cyhuang-tw / robust-vc
☆11Updated 3 years ago