Shivam0712 / End-to-End_Speech-to-Text_Translation
An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most spoken language in the world) directly to the text in English(First most spoken language).
☆17Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for End-to-End_Speech-to-Text_Translation
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆47Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated 2 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆44Updated 5 months ago
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆82Updated 8 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆72Updated 2 years ago
- ☆41Updated last year
- Text to Speech for Indic languages☆48Updated 2 years ago
- End-to-End Speech Recognition☆10Updated 3 years ago
- ☆42Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆164Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆22Updated 10 months ago
- This project is about performing Speaker diarization for Hindi Language.☆45Updated 3 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆91Updated last year
- Goodness of Pronunciation using Kaldi on Epa-DB database☆33Updated 10 months ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 5 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆13Updated 4 years ago
- Code for AccentDB.☆19Updated 3 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆89Updated 2 years ago
- ☆40Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆115Updated last year
- Deep Learning model for lexical stress detection in spoken English☆27Updated 4 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆33Updated last year
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆15Updated last year
- End-to-End Mispronunciation Detection via wav2vec2.0☆42Updated 2 years ago
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆26Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆19Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆47Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 4 years ago