cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 4 years ago
Related projects: ⓘ
- A library of speech gadgets.☆13Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- readers that enable reading kaldi ark in tensorflow☆17Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- Unsupervised spoken sentence embeddings☆14Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 3 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆11Updated last month
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆34Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- ☆16Updated 2 years ago
- ☆13Updated this week
- ☆17Updated 2 years ago
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 6 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆8Updated 2 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Updated 9 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Updated 3 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆10Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- ☆13Updated this week
- ☆24Updated 4 years ago
- ☆10Updated 11 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆16Updated 6 months ago
- ☆10Updated 5 months ago
- phone inventory library☆14Updated last year