Sindhu-Hegde / gestsync
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆45Updated 7 months ago
Alternatives and similar repositories for gestsync:
Users that are interested in gestsync are comparing it to the libraries listed below
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆121Updated 3 weeks ago
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆50Updated 5 months ago
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆73Updated 7 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆48Updated 4 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 2 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆52Updated last month
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆74Updated 10 months ago
- An official implementation of SwapAnyone.☆59Updated last month
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 5 months ago
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆118Updated 2 months ago
- FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and g…☆13Updated 11 months ago
- ☆24Updated last year
- [WACV 2025] - EmoVOCA: Speech-Driven Emotional 3D Talking Heads☆20Updated 2 months ago
- Interactive Video Generation via Masked-Diffusion☆80Updated last year
- BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captio…☆33Updated 3 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆38Updated last year
- ☆35Updated 7 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆61Updated 9 months ago
- ☆3Updated 6 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆70Updated this week
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆40Updated 2 weeks ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆53Updated this week
- ☆16Updated 7 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated 8 months ago
- ☆36Updated 7 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆68Updated 6 months ago
- ☆13Updated last month
- Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..☆180Updated last week
- Text and image to video generation: Kandinsky 4.0 (2024)☆144Updated 4 months ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆30Updated last year