Evaluation script for VoxMovies dataset in PyTorch
☆23Jan 12, 2024Updated 2 years ago
Alternatives and similar repositories for VoxMovies
Users that are interested in VoxMovies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 9 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 4 months ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆42Jan 22, 2024Updated 2 years ago
- Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns☆17Nov 15, 2022Updated 3 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- ☆95Aug 2, 2023Updated 2 years ago
- Transfer PaddlePaddle's codes to TensorLayerX's codes☆10Feb 10, 2023Updated 3 years ago
- ☆17Oct 16, 2018Updated 7 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 4 months ago
- Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion☆25Mar 16, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- repo for active speaker detection for media videos.☆31Nov 19, 2023Updated 2 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆75Sep 16, 2020Updated 5 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆76Aug 24, 2023Updated 2 years ago
- Website-based resource monitor for Slurm system☆37Apr 6, 2023Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Oct 9, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- Splits for epic-sounds dataset☆86Aug 2, 2025Updated 7 months ago
- ☆16Feb 19, 2026Updated last month
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- ☆12Mar 11, 2025Updated last year
- ☆16Mar 7, 2019Updated 7 years ago
- ☆20Apr 18, 2024Updated last year
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆119Oct 9, 2023Updated 2 years ago
- [NeurIPS'22] ReCo: Retrieve and Co-segment for Zero-shot Transfer☆62Apr 20, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆32Jan 6, 2022Updated 4 years ago
- Directional sparse filtering for blind speech separation☆10Jun 8, 2021Updated 4 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆48Nov 4, 2020Updated 5 years ago
- ☆10Dec 8, 2022Updated 3 years ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated 2 months ago