Code for paper "direct speech-to-image translation"
☆26Jun 8, 2020Updated 5 years ago
Alternatives and similar repositories for speech-to-image-translation-without-text
Users that are interested in speech-to-image-translation-without-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- code for paper "Universal Adversarial Perturbations Generative Network for Speaker Recognition"☆23Nov 23, 2020Updated 5 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Jan 21, 2021Updated 5 years ago
- ☆24Jun 4, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- ☆10Apr 2, 2024Updated last year
- Review of papers I read☆14Dec 11, 2020Updated 5 years ago
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 3 years ago
- Deep Neural Networks for audio classification☆11Apr 11, 2024Updated last year
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆15Mar 23, 2026Updated last week
- EC499: Major Project☆10Jun 25, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Tel…☆13May 10, 2018Updated 7 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- SplitSR: An End-to-End Approach to Super-Resolution on Mobile Devices (Unofficial Implementation)☆29Jan 24, 2021Updated 5 years ago
- A tool for visualization of complex job searches.☆13Jul 8, 2022Updated 3 years ago
- [FG 2019 Oral] Attribute-Guided Sketch Generation☆10Jul 25, 2021Updated 4 years ago
- Computer Vision Models☆12Mar 1, 2023Updated 3 years ago
- What part of a song is better at determining it's music genre - the music (audio features) or the lyrics (NLP) ?☆14Jan 2, 2023Updated 3 years ago
- 中文文本近似计算☆13Jan 22, 2019Updated 7 years ago
- ☆13Dec 11, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 3 years ago
- ☆13Jul 4, 2020Updated 5 years ago
- a simple flight shooting game☆10Jan 17, 2016Updated 10 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Nov 13, 2019Updated 6 years ago
- ☆10Aug 13, 2020Updated 5 years ago
- MathBot is a transformer-based Math Word Problem (MWP) solver made as the Lab project for CSE 4622: Machine Learning Lab.☆13Jul 11, 2022Updated 3 years ago
- StammerClipper:: :A deep learning approach for automatic stutter detection☆12Mar 27, 2022Updated 4 years ago
- Best Collection of Articles and code for Audio Classification☆16Oct 11, 2019Updated 6 years ago
- 28th place solution in Kaggle HPA classification☆30Dec 16, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Sublime Text's plugin for the outstanding new language Wenyan, features including syntax highlighting, building...☆13Jan 18, 2022Updated 4 years ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆15Mar 4, 2022Updated 4 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- Korean Speech to English Translation Corpus☆45Sep 3, 2021Updated 4 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago