Shivam0712/End-to-End_Speech-to-Text_Translation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shivam0712/End-to-End_Speech-to-Text_Translation)

Shivam0712 / End-to-End_Speech-to-Text_Translation

An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most spoken language in the world) directly to the text in English(First most spoken language).

☆17

Alternatives and similar repositories for End-to-End_Speech-to-Text_Translation

Users that are interested in End-to-End_Speech-to-Text_Translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ictnlp / BT4ST
View on GitHub
Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
☆11Oct 25, 2023Updated 2 years ago
krishnaKanta2008 / PredictHub
View on GitHub
PredictHub is a sophisticated stock price prediction platform that combines machine learning with real-time market data analysis. The app…
☆14Aug 15, 2025Updated 11 months ago
libindic / Transliteration
View on GitHub
Transliteration module for Indian Languages
☆79Oct 24, 2025Updated 8 months ago
DoubleN96 / AIBookingAssistant
View on GitHub
Link to the dashboard
☆13Apr 21, 2023Updated 3 years ago
MathurUtkarsh / Video-Captioning-Using-LSTM-and-Keras
View on GitHub
Generating Video Caption Using LSTM
☆12May 29, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
de9uch1 / fairseq-tutorial
View on GitHub
Fairseq tutorial
☆18May 18, 2022Updated 4 years ago
zhenghuatan / Audio-adversarial-examples
View on GitHub
Datasets of audio adversarial examples for deep speech recognition systems and Python code of a detection system
☆14May 6, 2023Updated 3 years ago
vermasrijan / Neural_Machine_Translator_seq2seq
View on GitHub
Neural Machine Translator for translating from english to hindi text. Used Pytorch framework with seq2seq architecture having Attention f…
☆13Jan 21, 2019Updated 7 years ago
kahne / SpeechTransProgress
View on GitHub
Tracking the progress in end-to-end speech translation
☆260Oct 25, 2023Updated 2 years ago
benbaker76 / FlaskGPT
View on GitHub
FlaskGPT is a minimal ChatGPT clone that uses Python, Flask, langchain and Chroma with realtime token output using SSE.
☆12Sep 30, 2023Updated 2 years ago
cristinae / ASRdys
View on GitHub
ASR for dysarthric speakers with Kaldi
☆13Jan 14, 2017Updated 9 years ago
shreydan / shakespeareGPT
View on GitHub
understanding language modeling by training a small GPT on Shakespeare plays.
☆12Feb 15, 2023Updated 3 years ago
michaelmml / NLP-Information-Extraction
View on GitHub
Automated PDF and text processing with Spacy and NLTK; information extraction from text based on grammatical structure; deployed on extra…
☆16Apr 1, 2022Updated 4 years ago
parinzee / nexus-app
View on GitHub
React-native + Fastapi + Websockets
☆12Mar 20, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mdsohelmahmood / stock-price-predict
View on GitHub
☆11Feb 7, 2021Updated 5 years ago
azadyasar / NeuralMachineTranslation
View on GitHub
PyTorch implementation of NMT models along with custom tokenizers, models, and datasets
☆21Aug 1, 2022Updated 3 years ago
YanSte / NLP-LLM-Fine-tuning-QA-LoRA-T5
View on GitHub
Natural Language Processing (NLP) and Large Language Models (LLM) with Fine-Tuning LLM and make Chatbot Question answering (QA) with LoRA…
☆12Jan 20, 2024Updated 2 years ago
Glebzok / MAD-GAN
View on GitHub
☆12Apr 14, 2021Updated 5 years ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
Felix-Nilsson / gpt-internship
View on GitHub
Exploring the use case of LLMs in healthcare, in particular assisting in document retrieval and summarization.
☆12Aug 28, 2023Updated 2 years ago
johnmoses / coursera-nlp-specialization
View on GitHub
Coursera Natural Language Procession Specialization
☆14May 9, 2023Updated 3 years ago
Janie1996 / AV4SER
View on GitHub
PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition
☆12Mar 20, 2022Updated 4 years ago
IndieCoderMM / smart-one-ai
View on GitHub
🤖 AI assistant that can listen to user input and provide responses. It includes GUI to print the result and receive text input. Built wi…
☆16Dec 29, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ArushiSinghal / Neural-Machine-Translation-English-Hindi-for-domain-data
View on GitHub
NLP Application Project
☆21May 4, 2019Updated 7 years ago
xuchenneu / SATE
View on GitHub
End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding
☆26Aug 12, 2021Updated 4 years ago
m3yrin / aligned-cross-entropy
View on GitHub
Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655
☆21Jul 25, 2024Updated last year
medmac01 / healthyAI
View on GitHub
An AI based solution to help people self diagnose their health issues. Based on GPT-3 Language Model
☆18Oct 10, 2023Updated 2 years ago
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
ictnlp / NAST-S2x
View on GitHub
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
☆78Oct 22, 2024Updated last year
valu-digital / npm-packages
View on GitHub
Monorepo of open source npm packages by Valu Digital.
☆13Mar 13, 2026Updated 4 months ago
trexwithoutt / Speech-Emotion-Recognition-utterancelevel-DNN
View on GitHub
Inspired work by the project of SER using ELM at Microsoft Research
☆19Jul 4, 2018Updated 8 years ago
omise / omise-flask-example
View on GitHub
Example Flask app demonstrating the Omise payment gateway
☆19Jul 9, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ngtranminhtuan / LLMOPS
View on GitHub
NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.
☆22Mar 15, 2024Updated 2 years ago
KhanhHua2102 / Monetize.ai
View on GitHub
Monetize.ai is a web-based chatbot that provides personalized investment advice using GPT-3.5 and Yahoo Finance API. It's built using Fla…
☆17Mar 10, 2026Updated 4 months ago
jyhan03 / channel-decorrelation
View on GitHub
multi-channel target speech extraction with channel decorrelation and target speaker adaptation
☆27Feb 19, 2021Updated 5 years ago
andi611 / Conditional-SpecGAN-Tensorflow
View on GitHub
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
☆10Dec 12, 2018Updated 7 years ago
theoomoregbee / Angular-resolvers
View on GitHub
Understanding angular resolvers
☆13Apr 25, 2018Updated 8 years ago
KrishnaDN / Keyword-Transformer
View on GitHub
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23May 19, 2021Updated 5 years ago
archinetai / cqt-pytorch
View on GitHub
An invertible and differentiable implementation of the Constant-Q Transform (CQT).
☆73Dec 9, 2022Updated 3 years ago