VoxSRC2022 workshop development kit
☆19Jul 21, 2022Updated 3 years ago
Alternatives and similar repositories for VoxSRC2022
Users that are interested in VoxSRC2022 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Aug 15, 2021Updated 4 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆76Aug 24, 2023Updated 2 years ago
- ☆55Oct 17, 2023Updated 2 years ago
- Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns☆17Nov 15, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated 2 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion☆25Mar 16, 2023Updated 3 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021☆18Jul 21, 2021Updated 4 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- ☆31Mar 2, 2021Updated 5 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Simple diarization model☆54Jun 13, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- ☆17Jun 30, 2020Updated 5 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Jun 22, 2022Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated last year
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆166Jul 26, 2022Updated 3 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆133Oct 18, 2024Updated last year
- VoxSRC Challenge☆31Jun 11, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 11 months ago
- ☆51Nov 24, 2022Updated 3 years ago
- ☆22May 11, 2026Updated 2 weeks ago
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆19May 23, 2024Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- ☆160Jan 9, 2023Updated 3 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Sep 13, 2022Updated 3 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official repository of Dynamic-SUPERB.☆200Jun 24, 2025Updated 11 months ago
- Official repository for RawNet, RawNet2, and RawNet3☆406Mar 21, 2024Updated 2 years ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Dec 19, 2018Updated 7 years ago
- Collection of scripts from mHuBERT-147.☆35Nov 19, 2024Updated last year
- [CVPR2025] Official code for Lost in Translation Found in Context☆24Jan 14, 2026Updated 4 months ago