Sindhu-Hegde / gestsyncLinks
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆46Updated 9 months ago
Alternatives and similar repositories for gestsync
Users that are interested in gestsync are comparing it to the libraries listed below
Sorting:
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆127Updated 2 months ago
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆51Updated 7 months ago
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆74Updated 9 months ago
- ☆72Updated last month
- ☆44Updated this week
- [WACV 2025] Official implementation of "Face Anonymization Made Simple"☆177Updated this week
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆122Updated 4 months ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆36Updated 2 years ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆80Updated last year
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆49Updated 6 months ago
- Combine digital painting with AI image generation.☆140Updated last week
- BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captio…☆37Updated 5 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated 10 months ago
- ☆64Updated last year
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆84Updated last year
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation☆171Updated last month
- An official implementation of SwapAnyone.☆62Updated 3 months ago
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆12Updated 2 years ago
- ☆36Updated 9 months ago
- ☆21Updated 3 months ago
- Vico: Compositional Video Generation as Flow Equalization☆57Updated 7 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆48Updated 2 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆56Updated 2 months ago
- An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community …☆60Updated this week
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆133Updated 8 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆62Updated 3 months ago
- ☆170Updated 2 months ago
- repo for active speaker detection for media videos.☆27Updated last year
- ☆27Updated 10 months ago
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆52Updated 9 months ago