Sindhu-Hegde / gestsync
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆36Updated 4 months ago
Alternatives and similar repositories for gestsync:
Users that are interested in gestsync are comparing it to the libraries listed below
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆121Updated 2 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆59Updated 3 months ago
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆112Updated 8 months ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆79Updated 7 months ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆68Updated 7 months ago
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆50Updated 2 months ago
- repo for active speaker detection for media videos.☆22Updated last year
- ☆63Updated 9 months ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆23Updated last year
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆44Updated 4 months ago
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆68Updated 4 months ago
- This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assis…☆48Updated 2 years ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆34Updated 9 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆34Updated last month
- ☆15Updated 2 years ago
- Video-LlaVA fine-tune for CinePile evaluation☆46Updated 5 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆54Updated 5 months ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Updated 10 months ago
- Speech-driven 3D Talking Heads Generation☆59Updated last year
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆11Updated last year
- Language-Guided Face Animation by Recurrent StyleGAN-based Generator☆19Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆47Updated this week
- We present a model that can generate accurate 3D sound fields of human bodies from headset microphones and body pose as inputs.☆85Updated 7 months ago
- Implementation for the paper "Can Language Models Learn to Listen?"☆61Updated last year
- Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..☆166Updated 3 weeks ago
- [ECCV 2024 Oral] Audio-Synchronized Visual Animation☆43Updated 4 months ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆53Updated this week
- ☆24Updated last year
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆42Updated 3 months ago