Sindhu-Hegde / gestsync
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆43Updated 5 months ago
Alternatives and similar repositories for gestsync:
Users that are interested in gestsync are comparing it to the libraries listed below
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆121Updated 3 months ago
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆50Updated 3 months ago
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆69Updated 5 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆47Updated 6 months ago
- BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captio…☆24Updated last month
- ☆35Updated 4 months ago
- A Gradio app for analyzing audio files to determine true sample rate and bit depth.☆15Updated 5 months ago
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆11Updated last year
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆112Updated last week
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆69Updated 8 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆41Updated 2 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆44Updated 5 months ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆25Updated last year
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆16Updated 4 months ago
- ☆26Updated 7 months ago
- ☆63Updated 10 months ago
- This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assis…☆49Updated 2 years ago
- [WACV 2025] Official implementation of "Face Anonymization Made Simple"☆157Updated last month
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆23Updated 3 weeks ago
- Official repository of Wavehax vocoder☆46Updated 2 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 5 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆45Updated 4 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆64Updated 4 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆55Updated 7 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆36Updated 10 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆62Updated 4 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆55Updated last month
- ☆24Updated 6 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆47Updated last week