An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most spoken language in the world) directly to the text in English(First most spoken language).
☆18Jul 14, 2019Updated 6 years ago
Alternatives and similar repositories for End-to-End_Speech-to-Text_Translation
Users that are interested in End-to-End_Speech-to-Text_Translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the paper "ViHateT5: Enhancing Hate Speech Detection in Vietnamese with A Unified Text-to-Text Transformer Model" (ACL'202…☆10Aug 13, 2024Updated last year
- Real-time speech to text with specific language translation.☆47Oct 9, 2020Updated 5 years ago
- Offline speech recognition for Gujarati Language.☆22Dec 20, 2022Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 2 months ago
- Deploying a Machine Learning model streaming application with Apache Kafka☆11Aug 21, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Tracking the progress in end-to-end speech translation☆260Oct 25, 2023Updated 2 years ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- ☆17Jul 15, 2023Updated 2 years ago
- ☆12Apr 14, 2021Updated 5 years ago
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition☆12Mar 20, 2022Updated 4 years ago
- Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655☆21Jul 25, 2024Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆58Apr 14, 2025Updated last year
- Inspired work by the project of SER using ELM at Microsoft Research☆19Jul 4, 2018Updated 7 years ago
- ☆12Sep 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Monorepo of open source npm packages by Valu Digital.☆13Mar 13, 2026Updated last month
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆27Feb 19, 2021Updated 5 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23May 19, 2021Updated 4 years ago
- Understanding angular resolvers☆13Apr 25, 2018Updated 8 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- Basic Chat Room Application☆17Jun 19, 2023Updated 2 years ago
- source code of GASNet.☆19Jan 3, 2021Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆72Dec 9, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.☆22Mar 15, 2024Updated 2 years ago
- Awesome list of WPGraphQL☆10Jun 16, 2021Updated 4 years ago
- A Lucky template bootstrapped from lucky init. Like thoughtbot/suspenders, but for lucky!☆10Sep 11, 2023Updated 2 years ago
- Python DSL that compiles element-wise expressions to parallel Rust. All CPU cores, zero serialization.☆45Mar 19, 2026Updated last month
- ☆17Mar 19, 2025Updated last year
- My personal website☆12Mar 6, 2023Updated 3 years ago
- Gold price forecasting using time series is a statistical technique that involves analyzing historical data to predict future trends in t…☆20Apr 4, 2023Updated 3 years ago
- A dashboard for managing referral in a better way☆12Jun 29, 2022Updated 3 years ago
- Supplementary material for SDM 19 paper "LSCP: Locally Selective Combination in Parallel Outlier Ensembles"☆33Dec 3, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Classify the baby cry into 8 different categories (hungry, needs burping, scared, belly pain, discomfort, cold/hot, lonely, tired).☆30Dec 6, 2023Updated 2 years ago
- Collaborative shopping basket built with Liveblocks in React/Next.js☆15Nov 27, 2023Updated 2 years ago
- Implementation of DCTTS with Adversarial Training☆12Dec 30, 2019Updated 6 years ago
- Searching YouTube with the YouTube Data API v3☆16Dec 9, 2018Updated 7 years ago
- An example of how to use parser combinators with Express for routing.☆11Nov 8, 2017Updated 8 years ago
- ☆12Aug 24, 2022Updated 3 years ago
- A collection of pre-built speech synthesis settings used to convey emotion☆11Jul 9, 2019Updated 6 years ago