Sindhu-Hegde / gestsyncLinks
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆46Updated 11 months ago
Alternatives and similar repositories for gestsync
Users that are interested in gestsync are comparing it to the libraries listed below
Sorting:
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆128Updated 4 months ago
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆52Updated 9 months ago
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆74Updated 11 months ago
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆124Updated 5 months ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆84Updated last year
- ☆77Updated 3 months ago
- ☆53Updated last week
- [WACV 2025] Official implementation of "Face Anonymization Made Simple"☆181Updated last month
- ☆61Updated last year
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆37Updated 2 years ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆262Updated last week
- Combine digital painting with AI image generation.☆143Updated last month
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆85Updated last year
- repo for active speaker detection for media videos.☆28Updated last year
- ☆76Updated 10 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆50Updated 7 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated last year
- T5-based (russian) text normalization☆22Updated last year
- MBASE, an LLM SDK in C++☆52Updated last month
- KandinskyVideo — multilingual end-to-end text2video latent diffusion model☆184Updated last year
- ☆36Updated 10 months ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆22Updated 3 months ago
- BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captio…☆38Updated 7 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆46Updated 11 months ago
- ☆75Updated 5 months ago
- Generative Modeling with Bayesian Sample Inference☆22Updated 2 months ago
- Enhance faces in AI generated images☆46Updated last month
- Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions☆242Updated 6 months ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆133Updated 10 months ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆100Updated last month