Code for paper "direct speech-to-image translation"
☆26Jun 8, 2020Updated 5 years ago
Alternatives and similar repositories for speech-to-image-translation-without-text
Users that are interested in speech-to-image-translation-without-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Karras et al. (2022) diffusion models for PyTorch☆17Oct 5, 2023Updated 2 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- Pytorch Code for S2IGAN☆40Aug 11, 2020Updated 5 years ago
- code for paper "Universal Adversarial Perturbations Generative Network for Speaker Recognition"☆23Nov 23, 2020Updated 5 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Jan 21, 2021Updated 5 years ago
- ☆24Jun 4, 2024Updated last year
- Electrophysiology practicals for undergraduate students☆13Mar 8, 2021Updated 5 years ago
- Review of papers I read☆14Dec 11, 2020Updated 5 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆14Jun 6, 2023Updated 2 years ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 3 years ago
- Deep Neural Networks for audio classification☆11Apr 11, 2024Updated 2 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆16Apr 1, 2026Updated last month
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073☆15Dec 2, 2025Updated 5 months ago
- This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Tel…☆12May 10, 2018Updated 8 years ago
- A GAN demo project☆13Jan 2, 2020Updated 6 years ago
- SplitSR: An End-to-End Approach to Super-Resolution on Mobile Devices (Unofficial Implementation)☆29Jan 24, 2021Updated 5 years ago
- A tool for visualization of complex job searches.☆13Jul 8, 2022Updated 3 years ago
- A board game, designed by ChatGPT and Midjourney, implemented with event-driven architecture and finite state machines☆13May 6, 2026Updated 3 weeks ago
- [FG 2019 Oral] Attribute-Guided Sketch Generation☆10Jul 25, 2021Updated 4 years ago
- What part of a song is better at determining it's music genre - the music (audio features) or the lyrics (NLP) ?☆14Jan 2, 2023Updated 3 years ago
- 中文文本近似计算☆13Jan 22, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Dec 11, 2020Updated 5 years ago
- Keras implementation of the article "Solving internal covariate shift in deep learning with linked neurons"☆13Dec 8, 2017Updated 8 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 4 years ago
- This repository contains a diverse collection of case studies and use cases commonly asked in data science interviews across different co…☆18Apr 14, 2024Updated 2 years ago
- Course Materials (along with assignments) for Intro to NLP, done as a part for requirement of the course "Introduction to NLP" (course-co…☆10Jan 2, 2023Updated 3 years ago
- ☆16Apr 27, 2025Updated last year
- a simple flight shooting game☆10Jan 17, 2016Updated 10 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Nov 13, 2019Updated 6 years ago
- ☆12Mar 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15May 28, 2020Updated 6 years ago
- ☆10Aug 13, 2020Updated 5 years ago
- ☆15Nov 11, 2025Updated 6 months ago
- ☆13Sep 1, 2025Updated 8 months ago
- Defect prediction of java projects using neural networks.☆15Jun 28, 2017Updated 8 years ago
- 28th place solution in Kaggle HPA classification☆30Dec 16, 2025Updated 5 months ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago