An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most spoken language in the world) directly to the text in English(First most spoken language).
☆18Jul 14, 2019Updated 6 years ago
Alternatives and similar repositories for End-to-End_Speech-to-Text_Translation
Users that are interested in End-to-End_Speech-to-Text_Translation are comparing it to the libraries listed below
Sorting:
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 3 years ago
- Offline speech recognition for Gujarati Language.☆22Dec 20, 2022Updated 3 years ago
- Transliteration module for Indian Languages☆79Oct 24, 2025Updated 4 months ago
- Fairseq tutorial☆17May 18, 2022Updated 3 years ago
- The IIT Bombay English-Hindi Parallel Corpus☆20Apr 26, 2022Updated 3 years ago
- Tracking the progress in end-to-end speech translation☆261Oct 25, 2023Updated 2 years ago
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- ☆15Jun 15, 2022Updated 3 years ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- ☆17Jul 15, 2023Updated 2 years ago
- NLP Application Project☆21May 4, 2019Updated 6 years ago
- Extract audio from a video file and summarize it using OpenAI API☆34Jul 7, 2024Updated last year
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆77Oct 22, 2024Updated last year
- End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding☆26Aug 12, 2021Updated 4 years ago
- Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655☆21Jul 25, 2024Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆57Apr 14, 2025Updated 11 months ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Jul 4, 2018Updated 7 years ago
- Efficient and easy to use transliteration for Indian languages☆50Aug 7, 2020Updated 5 years ago
- ☆12Sep 8, 2022Updated 3 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆27Feb 19, 2021Updated 5 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23May 19, 2021Updated 4 years ago
- Understanding angular resolvers☆13Apr 25, 2018Updated 7 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- Awesome list of WPGraphQL☆10Jun 16, 2021Updated 4 years ago
- A Lucky template bootstrapped from lucky init. Like thoughtbot/suspenders, but for lucky!☆10Sep 11, 2023Updated 2 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆59Jul 9, 2021Updated 4 years ago
- 🏳️🌈⃤ Sequelize models generator for prisma schema☆14Sep 17, 2021Updated 4 years ago
- ☆17Mar 19, 2025Updated last year
- My personal website☆12Mar 6, 2023Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Collaborative shopping basket built with Liveblocks in React/Next.js☆15Nov 27, 2023Updated 2 years ago
- Implementation of DCTTS with Adversarial Training☆12Dec 30, 2019Updated 6 years ago
- ☆14Jul 10, 2023Updated 2 years ago
- An example of how to use parser combinators with Express for routing.☆11Nov 8, 2017Updated 8 years ago
- Searching YouTube with the YouTube Data API v3☆16Dec 9, 2018Updated 7 years ago
- ☆12Aug 24, 2022Updated 3 years ago
- ☆11Aug 19, 2016Updated 9 years ago
- Utilities for negotiating between circles and polygons in SVG☆13Apr 22, 2017Updated 8 years ago
- Awesome articles weekly 📖☆10Apr 28, 2020Updated 5 years ago