rash1993 / movie-asd
repo for active speaker detection for media videos.
☆18Updated 10 months ago
Related projects: ⓘ
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆95Updated 5 months ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆64Updated 10 months ago
- ☆35Updated 3 weeks ago
- Anim-400K: A dataset designed from the ground up for automated dubbing of video☆97Updated 3 months ago
- This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies☆30Updated 11 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆37Updated 2 months ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆26Updated 2 weeks ago
- This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assis…☆42Updated last year
- Graph learning framework for long-term video understanding☆49Updated 5 months ago
- Video shot transition detection☆21Updated last year
- The project page repo for Neural Dubber.☆27Updated last year
- Incredibly descriptive audiovisual summaries for videos☆39Updated last month
- ☆39Updated 2 months ago
- Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model☆140Updated last month
- FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and g…☆10Updated 4 months ago
- The demo page of UniAudio☆34Updated 7 months ago
- ☆11Updated 2 years ago
- ☆61Updated last month
- PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.☆169Updated last month
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆16Updated last year
- ☆14Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))☆25Updated last month
- Demo for 2022 ICASSP☆64Updated 2 years ago
- ☆40Updated 2 months ago
- ☆24Updated 5 months ago
- ☆15Updated 2 years ago
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated last year
- 📖 A curated list of resources dedicated to avatar.☆51Updated last week
- ☆57Updated 2 years ago