Sindhu-Hegde / gestsyncLinks
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆46Updated last year
Alternatives and similar repositories for gestsync
Users that are interested in gestsync are comparing it to the libraries listed below
Sorting:
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆135Updated 8 months ago
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆55Updated last year
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆74Updated last year
- [WACV 2025] Official implementation of "Face Anonymization Made Simple"☆195Updated 5 months ago
- BLIP Live Image Captioning with Real-Time Video Stream This repository provides a Python-based implementation for real-time image captio…☆43Updated 11 months ago
- This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptati…☆129Updated 10 months ago
- Combine digital painting with AI image generation.☆144Updated 2 months ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆85Updated last year
- ☆78Updated 7 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated last year
- Enhance faces in AI generated images☆48Updated 6 months ago
- LIA-X: Interpretable Latent Portrait Animator☆91Updated 3 months ago
- ☆60Updated 3 weeks ago
- Swap your face in real-time☆76Updated 8 months ago
- ☆28Updated last month
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆50Updated 10 months ago
- Text Behind Video. Enjoy it is completely free.☆31Updated 10 months ago
- ☆36Updated last year
- This is open-source implementation of MixedAE (https://arxiv.org/pdf/2303.17152.pdf)☆22Updated 10 months ago
- ☆61Updated last year
- Dense Interspecies Face Embedding (NeurIPS 2022)☆25Updated 2 years ago
- font-classify☆111Updated last year
- KandinskyVideo — multilingual end-to-end text2video latent diffusion model☆184Updated last year
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆136Updated last year
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆291Updated 2 months ago
- Official Implementation for "The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Ed…☆107Updated last year
- ☆94Updated last year
- MBASE, an LLM SDK in C++☆56Updated 5 months ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆329Updated 2 months ago
- ☆86Updated last month