smallflyingpig / speech-to-image-translation-without-textView external linksLinks
Code for paper "direct speech-to-image translation"
☆27Jun 8, 2020Updated 5 years ago
Alternatives and similar repositories for speech-to-image-translation-without-text
Users that are interested in speech-to-image-translation-without-text are comparing it to the libraries listed below
Sorting:
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- Karras et al. (2022) diffusion models for PyTorch☆17Oct 5, 2023Updated 2 years ago
- ☆21Mar 7, 2023Updated 2 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Jan 21, 2021Updated 5 years ago
- ☆24Jun 4, 2024Updated last year
- code for paper "Universal Adversarial Perturbations Generative Network for Speaker Recognition"☆23Nov 23, 2020Updated 5 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Feb 5, 2022Updated 4 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Pytorch Code for S2IGAN☆40Aug 11, 2020Updated 5 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- ☆13Dec 13, 2022Updated 3 years ago
- Electrophysiology practicals for undergraduate students☆13Mar 8, 2021Updated 4 years ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Apr 22, 2021Updated 4 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Mar 4, 2021Updated 4 years ago
- [ICLR 2023] RC-MAE☆53Dec 18, 2023Updated 2 years ago
- High-Level Training, Data Augmentation, and Utilities for Pytorch☆13Mar 8, 2019Updated 6 years ago
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Jul 29, 2021Updated 4 years ago
- 팡요랩 자료☆11May 31, 2019Updated 6 years ago
- Vectorize Image Data to SVG using POTRACE. Based on multilabel-potrace by Hugo Raguet, which is based on potrace by Peter Selinger.☆15Jul 26, 2025Updated 6 months ago
- Multi-labels anime image classification in rust☆12Mar 10, 2023Updated 2 years ago
- PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+)☆10Feb 12, 2019Updated 7 years ago
- Pytorch implementation of various token mixers; Attention Mechanisms, MLP, and etc for understanding computer vision papers and other tas…☆16Oct 7, 2024Updated last year
- python3 利用用TF特征向量和Simhash指纹计算中文文本的相似度的示例☆10Dec 13, 2019Updated 6 years ago
- Dual Recursive Network for Fast Image Deraining (ICIP 2019)☆10Feb 23, 2020Updated 5 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Official implementation of Lightweight Human Pose Estimation Using Loss Weighted by Target Heatmap that was honorably mentioned as Best P…☆12Dec 17, 2023Updated 2 years ago
- ☆10Jul 20, 2020Updated 5 years ago
- Paper Review about Speech Recognition · NLP