vinayak19th / ASR-Low-Resource
A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages
☆9Updated 4 years ago
Alternatives and similar repositories for ASR-Low-Resource:
Users that are interested in ASR-Low-Resource are comparing it to the libraries listed below
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆11Updated 5 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- ☆16Updated 2 years ago
- ☆25Updated 4 months ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 5 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆12Updated 3 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 7 months ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- magicspeech competition recipe☆18Updated 4 years ago
- Transformer based ASR Engine.☆12Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Updated 6 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- ☆12Updated last month
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Updated 7 years ago
- use 3 chinese senteces as training corpus to show how to build lm model and HCLG decoding graph☆8Updated 5 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 11 months ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆38Updated 5 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago