Revisiting End-to-End Speech-to-Text Translation From Scratch
☆13Feb 21, 2023Updated 3 years ago
Alternatives and similar repositories for st_from_scratch
Users that are interested in st_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- A repository by Codechef@MUST for data structures and algorithms☆20Oct 22, 2022Updated 3 years ago
- ☆25Oct 22, 2022Updated 3 years ago
- Offline speech recognition for Gujarati Language.☆22Dec 20, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- Android - colouring images using android native development kit (NDK) c++.using algorithm is floodfill algorithm☆24Oct 18, 2023Updated 2 years ago
- An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most s…☆18Jul 14, 2019Updated 6 years ago
- ☆25Feb 12, 2023Updated 3 years ago
- A collection of high-quality public recordings of Bach's sonatas and partitas for solo violin (BWV 1001–1006)☆39Feb 19, 2022Updated 4 years ago
- [ICLR 2022 official code] Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness?☆29Mar 15, 2022Updated 4 years ago
- Deep Performer: Score-to-audio music performance synthesis☆45Jun 26, 2023Updated 2 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆29Feb 29, 2024Updated 2 years ago
- ☆12Aug 15, 2022Updated 3 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆41Feb 10, 2022Updated 4 years ago
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆15Aug 31, 2021Updated 4 years ago