Sindhu-Hegde / gestsyncLinks
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆46Updated 9 months ago
Alternatives and similar repositories for gestsync
Users that are interested in gestsync are comparing it to the libraries listed below
Sorting:
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆124Updated 2 months ago
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆51Updated 6 months ago
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆73Updated 9 months ago
- ☆70Updated last month
- An official implementation of SwapAnyone.☆62Updated 2 months ago
- Official PyTorch implementation of TokenSet.☆121Updated 2 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆58Updated 2 months ago
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆121Updated 3 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 6 months ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆78Updated 11 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆48Updated last month
- ☆26Updated 9 months ago
- ☆23Updated last year
- We present a model that can generate accurate 3D sound fields of human bodies from headset microphones and body pose as inputs.☆84Updated last year
- ☆18Updated 2 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆74Updated last month
- ☆36Updated 8 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated 9 months ago
- ☆60Updated last year
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆49Updated 5 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆69Updated 8 months ago
- Official Implementation of GrounDiT (NeurIPS 2024)☆53Updated 5 months ago
- Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)☆62Updated 4 months ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆33Updated 2 years ago
- [CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.☆53Updated 2 months ago
- Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..☆184Updated last month
- A Gradio app for analyzing audio files to determine true sample rate and bit depth.☆17Updated 8 months ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆66Updated 4 months ago
- BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captio…☆36Updated 5 months ago