aqibahmad / speech2face
A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019
☆11Updated 2 years ago
Alternatives and similar repositories for speech2face:
Users that are interested in speech2face are comparing it to the libraries listed below
- ☆50Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆23Updated last year
- ☆21Updated 3 years ago
- ☆55Updated last year
- Tools for downloading VoxCeleb2 dataset☆29Updated last year
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆110Updated 2 years ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- Generating Talking Face Landmarks from Speech☆159Updated 2 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆159Updated 5 years ago
- This github contains the network architectures of NeuralVoicePuppetry.☆80Updated 4 years ago
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆129Updated 4 months ago
- An improved version of APB2Face: Real-Time Audio-Guided Multi-Face Reenactment☆82Updated 3 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]☆261Updated 9 months ago
- Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)☆76Updated 2 years ago
- The code for the paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"☆170Updated 2 years ago
- ☆42Updated last year
- PyTorch implementation for NED (CVPR 2022). It can be used to manipulate the facial emotions of actors in videos based on emotion labels …☆158Updated 2 years ago
- ☆29Updated 4 years ago
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆49Updated last month
- ☆35Updated 6 years ago
- Simple python script for downloading AVSpeech Dataset☆43Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- Code for paper 'Audio-Driven Emotional Video Portraits'.☆307Updated 3 years ago
- ☆95Updated 3 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆83Updated 3 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆66Updated last year
- This github contains the network architectures of NeuralVoicePuppetry.☆178Updated 4 years ago
- a PyTorch implementation of Lip2Wav☆50Updated 2 years ago