imatge-upc/wav2pix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/imatge-upc/wav2pix)

imatge-upc / wav2pix

Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)

☆57

Alternatives and similar repositories for wav2pix

Users that are interested in wav2pix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

miqueltubau / Wav2Pix
View on GitHub
Speech-conditioned face generation using Generative Adversarial Networks
☆88Dec 8, 2022Updated 3 years ago
franroldans / tfm-franroldan-wav2pix
View on GitHub
☆19Jul 14, 2019Updated 7 years ago
lelechen63 / ATVGnet
View on GitHub
CVPR 2019
☆258May 24, 2023Updated 3 years ago
ravising-h / Speech2Face
View on GitHub
Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation
☆60Apr 19, 2020Updated 6 years ago
Barbany / Multi-speaker-Neural-Vocoder
View on GitHub
Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommun…
☆16Jul 25, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cmu-mlsp / reconstructing_faces_from_voices
View on GitHub
[NeurIPS 2019] Face Reconstruction from Voice using Generative Adversarial Networks
☆193Jan 5, 2020Updated 6 years ago
hagerrady13 / DCGAN-PyTorch
View on GitHub
A PyTorch Implementation of Deep Convolutional Generative Adversarial Networks
☆12Aug 29, 2018Updated 7 years ago
aqibahmad / speech2face
View on GitHub
A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019
☆12Mar 25, 2023Updated 3 years ago
choyingw / Cross-Modal-Perceptionist
View on GitHub
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
☆130Dec 11, 2024Updated last year
cilkim1 / speech_ani_gan
View on GitHub
An implementation of http://openaccess.thecvf.com/content_CVPRW_2019/papers/Sight%20and%20Sound/Konstantinos_Vougioukas_End-to-End_Speech…
☆18Mar 19, 2020Updated 6 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
b04901014 / ISGAN
View on GitHub
☆21Nov 1, 2018Updated 7 years ago
MohammedAlghamdi / talking-heads-acm-mm
View on GitHub
Talking Head from Speech Audio using a Pre-trained Image Generator
☆22May 7, 2024Updated 2 years ago
ShangxuanWu / CycleGAN-Face-off
View on GitHub
Code for "CycleGAN Face-off" by Shangxuan Wu, Xiaohan Jin and Ye Qi.
☆17Dec 15, 2017Updated 8 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
joonson / yousaidthat
View on GitHub
You Said That?: Synthesising Talking Faces from Audio
☆70Apr 29, 2018Updated 8 years ago
Hangz-nju-cuhk / Talking-Face-Generation-DAVS
View on GitHub
Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)
☆813May 11, 2021Updated 5 years ago
imatge-upc / speech2face
View on GitHub
Speech-Conditioned Face Generation with Deep Adversarial Networks
☆134Feb 17, 2020Updated 6 years ago
TwistedW / tf-GANs-Loss
View on GitHub
Loss function of various types of GANs
☆26Oct 5, 2018Updated 7 years ago
JasonSWFu / JD-NMF
View on GitHub
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
☆22Oct 14, 2017Updated 8 years ago
firojalam / multimodal_social_media
View on GitHub
multimodal social media content (text, image) classification
☆53Jun 22, 2022Updated 4 years ago
cripac-sjx / SEA-T2F
View on GitHub
Multi-caption Text-to-Face Synthesis: Database and Algorithm
☆32Mar 17, 2022Updated 4 years ago
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
paarthneekhara / advoc
View on GitHub
Vocode spectrograms to audio with generative adversarial networks
☆64Aug 8, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sunny8898 / DeepSpeech-tensorflow
View on GitHub
将百度DeepSpeech的keras后端由theano改为tensorflow，整合mozilla解码模块进行中文语音识别模型部署
☆10Dec 2, 2019Updated 6 years ago
ruiguo-bio / colab_tension_vae
View on GitHub
☆14Oct 19, 2020Updated 5 years ago
edezhic / fashion-generator
View on GitHub
In-browser GPU-accelerated Generative Adversarial Network trained on Fashion-MNIST dataset (tensorflow + deeplearn.js)
☆11Aug 28, 2018Updated 7 years ago
zedix / prose-editor-element
View on GitHub
Prose Editor is a web component wrapping TipTap 2.
☆10Apr 7, 2024Updated 2 years ago
susanqq / Talking_Face_Generation
View on GitHub
Talking Face Generation by Conditional Recurrent Adversarial Network
☆61Dec 6, 2019Updated 6 years ago
matln / voxceleb_triplet-loss
View on GitHub
A Pytorch implementation of triplet loss on VoxCeleb1
☆12Oct 16, 2019Updated 6 years ago
khtee / text-classification-pytorch
View on GitHub
Pytorch implementation of RNN, CNN, BiGRU and LSTM for text classifcation
☆10Apr 30, 2021Updated 5 years ago
twke18 / Adversarial_Structure_Matching
View on GitHub
Adversarial Structure Matching for Structured Prediction Tasks
☆11Jun 4, 2024Updated 2 years ago
jixinya / EVP
View on GitHub
Code for paper 'Audio-Driven Emotional Video Portraits'.
☆314Mar 16, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
eeskimez / Talking-Face-Landmarks-from-Speech
View on GitHub
Generating Talking Face Landmarks from Speech
☆158Dec 22, 2022Updated 3 years ago
lelechen63 / Talking-head-Generation-with-Rhythmic-Head-Motion
View on GitHub
☆209Mar 10, 2021Updated 5 years ago
Quasimondo / ComfyUI-QuasimondoNodes
View on GitHub
A collection of various custom nodes for ComfyUI (Work in progress)
☆14Jun 9, 2025Updated last year
atharvacc / SigmaNewsProject
View on GitHub
stock Market Predicted for Kaggle-Sigma
☆14Mar 26, 2019Updated 7 years ago
nesl / asvspoof2019
View on GitHub
Our submission to the ASVspoof 2019: Automatic Speaker Verification Spoofing and Countermeasures Challenge
☆104Feb 20, 2020Updated 6 years ago
matthijsvk / TCDTIMITprocessing
View on GitHub
processing and extracting of face and mouth image files out of the TCDTIMIT database
☆47Sep 22, 2020Updated 5 years ago
torchgan / model-zoo
View on GitHub
Examples of Generative Adversarial Networks built using torchgan
☆12Jun 11, 2019Updated 7 years ago