☆24Mar 13, 2020Updated 6 years ago
Alternatives and similar repositories for mixup
Users that are interested in mixup are comparing it to the libraries listed below
Sorting:
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- A JAX library for building lattice-based speech transducer models☆47Mar 2, 2026Updated 2 weeks ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- ☆28Dec 22, 2021Updated 4 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆28Oct 7, 2025Updated 5 months ago
- Memory efficient transducer loss computation☆70Jun 10, 2022Updated 3 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆55Jan 2, 2020Updated 6 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Jun 7, 2021Updated 4 years ago
- MagicData-RAMC Dataset and Baseline☆58Sep 13, 2022Updated 3 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 7 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 4 months ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Mar 3, 2020Updated 6 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆167Apr 29, 2022Updated 3 years ago
- A curated list of papers dedicated to edit-distance as objective function☆53Aug 22, 2020Updated 5 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆56Sep 1, 2025Updated 6 months ago
- Instructions on downloading and using the LibriAdapt dataset☆47Aug 13, 2021Updated 4 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Jul 1, 2020Updated 5 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- ☆27Jan 19, 2021Updated 5 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 4 months ago
- ☆81Feb 24, 2026Updated 3 weeks ago