Sindhu-Hegde / gestsync
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆44Updated 7 months ago
Alternatives and similar repositories for gestsync:
Users that are interested in gestsync are comparing it to the libraries listed below
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆121Updated this week
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆50Updated 4 months ago
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆118Updated last month
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆71Updated 7 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆69Updated 5 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆51Updated 2 weeks ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆81Updated 9 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated last month
- A Gradio app for analyzing audio files to determine true sample rate and bit depth.☆15Updated 6 months ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆72Updated 9 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆47Updated 3 months ago
- Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..☆172Updated 3 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆38Updated last year
- A new one shot head swapping approach☆64Updated last month
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated 6 months ago
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆140Updated 8 months ago
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆47Updated 6 months ago
- An official implementation of SwapAnyone.☆56Updated 2 weeks ago
- ☆24Updated last year
- ☆63Updated last year
- ☆3Updated 6 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated 7 months ago
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆48Updated last week
- [WACV 2025] Official implementation of "Face Anonymization Made Simple"☆167Updated 2 months ago
- Interactive Video Generation via Masked-Diffusion☆79Updated 11 months ago
- An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community …☆60Updated 2 weeks ago
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆11Updated last year
- ☆30Updated 4 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆143Updated 3 months ago
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆21Updated 6 months ago