frozentoad9 / CMSTLinks
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
β13Updated 2 years ago
Alternatives and similar repositories for CMST
Users that are interested in CMST are comparing it to the libraries listed below
Sorting:
- Enable RNNLM lattice rescoring with Pytorch [kaldi]β12Updated 5 years ago
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β13Updated 2 years ago
- β76Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downsβ¦β32Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ24Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Updated last year
- bumble bee transformerβ14Updated 4 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated 2 years ago
- Online (real-time) decoder to be used with DeepSpeech2 modelβ25Updated 5 years ago
- Unsupervised spoken sentence embeddingsβ14Updated 2 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectioβ¦β36Updated 4 months ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ50Updated 11 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β45Updated 4 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.β11Updated 4 years ago
- asr2kβ52Updated last year
- β34Updated 4 years ago
- β11Updated 3 years ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.β10Updated 2 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"β11Updated 5 years ago
- Datasets for turn-taking researchβ15Updated last year
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β37Updated 2 years ago
- Train a fiwGAN or ciwGAN model using your own training dataβ13Updated 2 years ago
- Convert words to numbersβ21Updated 3 years ago
- docker for HF wav2vec2-sprintβ13Updated 4 years ago
- A tiny BERT for low-resource monolingual modelsβ31Updated 10 months ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.β16Updated 4 years ago
- phone inventory libraryβ16Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 2 years ago