saiteja-talluri / Speech2FaceView external linksLinks
Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL
☆178Mar 24, 2023Updated 2 years ago
Alternatives and similar repositories for Speech2Face
Users that are interested in Speech2Face are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019☆14Mar 25, 2023Updated 2 years ago
- ☆19Jun 8, 2021Updated 4 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Dec 8, 2022Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- [NeurIPS 2019] Face Reconstruction from Voice using Generative Adversarial Networks☆194Jan 5, 2020Updated 6 years ago
- ☆16Apr 27, 2025Updated 9 months ago
- ☆12Jul 11, 2019Updated 6 years ago
- The project page repo for Neural Dubber.☆30Sep 20, 2023Updated 2 years ago
- Speech-Conditioned Face Generation with Deep Adversarial Networks☆134Feb 17, 2020Updated 6 years ago
- ☆15May 8, 2021Updated 4 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- SVHF-Net for Cross-modal binary matching☆32Aug 22, 2018Updated 7 years ago
- ☆967Sep 10, 2023Updated 2 years ago
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆130Dec 11, 2024Updated last year
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆613Jun 22, 2025Updated 7 months ago
- Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalize…☆774Dec 15, 2023Updated 2 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- ☆19Jul 14, 2019Updated 6 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- Out of time: automated lip sync in the wild☆870Jan 23, 2024Updated 2 years ago
- This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character me…☆1,247Aug 20, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 5 months ago
- This repository provides scripts that can be used to visualize BVH files. These scripts were developed for the GENEA Challenge 2020, and …☆40Feb 23, 2023Updated 2 years ago
- ☆208Mar 10, 2021Updated 4 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆166Apr 12, 2020Updated 5 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 5 years ago
- ☆10Apr 8, 2024Updated last year
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆12Oct 9, 2024Updated last year