Sindhu-Hegde / gestsync
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆46Updated 8 months ago
Alternatives and similar repositories for gestsync
Users that are interested in gestsync are comparing it to the libraries listed below
Sorting:
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆124Updated last month
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆51Updated 6 months ago
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆73Updated 8 months ago
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆120Updated 3 months ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆33Updated 2 years ago
- ☆68Updated 2 weeks ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated 9 months ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆77Updated 11 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆49Updated 5 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- Official repository of Wavehax vocoder☆46Updated 5 months ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆83Updated 10 months ago
- This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assis…☆52Updated 2 years ago
- ☆42Updated 3 weeks ago
- ☆63Updated last year
- [WACV 2025] Official implementation of "Face Anonymization Made Simple"☆168Updated last week
- ☆35Updated 7 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated 8 months ago
- We present a model that can generate accurate 3D sound fields of human bodies from headset microphones and body pose as inputs.☆84Updated 11 months ago
- An official implementation of SwapAnyone.☆60Updated 2 months ago
- ☆144Updated last month
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆55Updated 2 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆69Updated 7 months ago
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆143Updated 10 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆73Updated 3 weeks ago
- repo for active speaker detection for media videos.☆26Updated last year
- A Gradio app for analyzing audio files to determine true sample rate and bit depth.☆17Updated 8 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆199Updated 3 weeks ago
- An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community …☆60Updated this week
- BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captio…☆36Updated 4 months ago