Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL
☆179Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for Speech2Face
Users that are interested in Speech2Face are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆62Apr 19, 2020Updated 6 years ago
- A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019☆13Mar 25, 2023Updated 3 years ago
- ☆19Jun 8, 2021Updated 4 years ago
- ☆16Apr 27, 2025Updated last year
- Speech-conditioned face generation using Generative Adversarial Networks☆88Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2019] Face Reconstruction from Voice using Generative Adversarial Networks☆194Jan 5, 2020Updated 6 years ago
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆132Dec 11, 2024Updated last year
- Speech-Conditioned Face Generation with Deep Adversarial Networks☆134Feb 17, 2020Updated 6 years ago
- SVHF-Net for Cross-modal binary matching☆32Aug 22, 2018Updated 7 years ago
- ☆56May 26, 2019Updated 6 years ago
- The project page repo for Neural Dubber.☆30Sep 20, 2023Updated 2 years ago
- Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…☆14Dec 9, 2021Updated 4 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- USB host-controller driver for Raspberry Pi☆15Sep 23, 2014Updated 11 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆19Jul 14, 2019Updated 6 years ago
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆13Aug 28, 2023Updated 2 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 8 months ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Speech to Facial Animation using GANs☆40Nov 3, 2021Updated 4 years ago
- Generating Talking Face Landmarks from Speech☆159Dec 22, 2022Updated 3 years ago
- Out of time: automated lip sync in the wild☆883Apr 17, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Jul 11, 2019Updated 6 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆57Feb 12, 2022Updated 4 years ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆67May 3, 2022Updated 4 years ago
- ☆25Apr 24, 2019Updated 7 years ago
- ☆962Sep 10, 2023Updated 2 years ago
- Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalize…☆775Dec 15, 2023Updated 2 years ago
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆962Jan 6, 2024Updated 2 years ago
- ☆27Jan 17, 2024Updated 2 years ago
- INA's library with pretrained models for gender and age prediction from faces.☆24Oct 7, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆615Jun 22, 2025Updated 10 months ago
- pytorch CTC implementation for ASR. Use eesen's fst decoder framework☆10Feb 27, 2020Updated 6 years ago
- SASV2 baseline, a track on ASVspoof5 phase2 challenge☆27Nov 12, 2025Updated 5 months ago
- This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character me…☆1,260Aug 20, 2024Updated last year
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- ☆209Mar 10, 2021Updated 5 years ago
- ☆15May 8, 2021Updated 4 years ago