repo for active speaker detection for media videos.
☆31Nov 19, 2023Updated 2 years ago
Alternatives and similar repositories for movie-asd
Users that are interested in movie-asd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆16Aug 26, 2020Updated 5 years ago
- Graph learning framework for long-term video understanding☆72Jul 13, 2025Updated 9 months ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆172Mar 23, 2025Updated last year
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆59May 29, 2023Updated 2 years ago
- Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset☆72Jan 18, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆467Oct 23, 2023Updated 2 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆63Jan 24, 2024Updated 2 years ago
- [WACV 2026] LASER: Lip Landmark Assisted Speaker Detection for Robustness official implemntation☆26Feb 26, 2026Updated 2 months ago
- ☆21May 2, 2026Updated last week
- The repository for Springer IJCV 2025 (LR-ASD: Lightweight and Robust Network for Active Speaker Detection)☆113Mar 23, 2025Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- ☆15Feb 22, 2025Updated last year
- Datasets of audio adversarial examples for deep speech recognition systems and Python code of a detection system☆15May 6, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆68Oct 29, 2023Updated 2 years ago
- [NeurIPS'22] Official Repository for Characterizing Datapoints via Second-Split Forgetting☆16Aug 11, 2023Updated 2 years ago
- Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering☆11Feb 16, 2023Updated 3 years ago
- ☆13Aug 28, 2018Updated 7 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆16Apr 29, 2025Updated last year
- Add Rain Streak Mask On Unparied Image Using GAN☆10Sep 12, 2020Updated 5 years ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆48Sep 1, 2024Updated last year
- Codebase for "Channel selection using Gumbel Softmax"☆19Jan 20, 2021Updated 5 years ago
- An online video editor created with ability to animate and export videos on the web!☆12Mar 6, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enh…☆17Dec 31, 2024Updated last year
- Thermal Indoor Motion Dataset☆16Apr 27, 2023Updated 3 years ago
- A python package of robust and effective defogging/dehazing method☆15Dec 30, 2018Updated 7 years ago
- ☆34Jun 2, 2023Updated 2 years ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- Home of https://scapy.net☆13Apr 10, 2026Updated last month
- ☆83Mar 10, 2025Updated last year
- Cascade of CNNs for Robust Facial Landmarks Detection☆15Jan 29, 2021Updated 5 years ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆21Nov 30, 2019Updated 6 years ago
- Netflix JavaScript API☆18Sep 29, 2017Updated 8 years ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆22Apr 3, 2024Updated 2 years ago
- Python (pip) package for fitting mixtures of Student's t-distributions using either maximum likelihood (EM) or Bayesian methodology (vari…☆11Sep 23, 2025Updated 7 months ago
- A curated list of Story Ending Generation models; DASFAA'22: Incorporating Commonsense Knowledge into Story Ending Generation via Heterog…☆14May 12, 2022Updated 3 years ago
- Kaggle Cats vs. Dogs Redux Edition☆21Mar 11, 2017Updated 9 years ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year