frozentoad9 / CMST
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for CMST
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Unsupervised spoken sentence embeddings☆14Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- A library for data streaming and augmentation☆20Updated 8 months ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- phone inventory library☆15Updated last year
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- ☆11Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- Code for AccentDB.☆19Updated 3 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆48Updated 2 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- asr2k☆48Updated 5 months ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆11Updated 4 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆23Updated 7 months ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆23Updated last year
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- The History of Speech Recognition to the Year 2030☆11Updated 3 years ago
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago