JaesungHuh/VoxSRC2022

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JaesungHuh/VoxSRC2022)

JaesungHuh / VoxSRC2022

VoxSRC2022 workshop development kit

☆19

Alternatives and similar repositories for VoxSRC2022

Users that are interested in VoxSRC2022 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JaesungHuh / VoxMovies
View on GitHub
Evaluation script for VoxMovies dataset in PyTorch
☆23Jan 12, 2024Updated 2 years ago
joonson / voxceleb_unsupervised
View on GitHub
Augmentation adversarial training for self-supervised speaker recognition
☆77Aug 15, 2021Updated 4 years ago
clovaai / lookwhostalking
View on GitHub
Look Who’s Talking: Active Speaker Detection in the Wild
☆76Aug 24, 2023Updated 2 years ago
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 5 months ago
BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
karazijal / probable-motion
View on GitHub
Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns
☆17Nov 15, 2022Updated 3 years ago
ductuantruong / enskd
View on GitHub
[ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
☆16Mar 20, 2024Updated 2 years ago
a-nagrani / VoxSRC2020
View on GitHub
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
☆43Jul 17, 2020Updated 6 years ago
karazijal / guess-what-moves
View on GitHub
Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion
☆25Mar 16, 2023Updated 3 years ago
JaesungHuh / VoxSRC2021
View on GitHub
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021
☆19Jul 21, 2021Updated 5 years ago
TaoRuijie / AVCleanse
View on GitHub
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
m-koichi / ConformerSED
View on GitHub
☆31Mar 2, 2021Updated 5 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
JaesungHuh / SimpleDiarization
View on GitHub
Simple diarization model
☆53Jun 13, 2025Updated last year
someonefighting / tf-kaldi-speaker-master
View on GitHub
☆17Jun 30, 2020Updated 6 years ago
wngh1187 / RawNeXt
View on GitHub
Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…
☆25Jun 22, 2022Updated 4 years ago
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
PunkMale / OR-Gate
View on GitHub
Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.
☆12Oct 23, 2023Updated 2 years ago
tarun360 / SpeakerProfiling
View on GitHub
Estimating the Age, Height, and Gender of a speaker with their speech signal.
☆15Sep 19, 2022Updated 3 years ago
ankitapasad / layerwise-analysis
View on GitHub
Layer-wise analysis of self-supervised pre-trained speech representations
☆135Oct 18, 2024Updated last year
joonson / voxconverse
View on GitHub
Spot the conversation: speaker diarisation in the wild
☆171Jul 26, 2022Updated 4 years ago
joonson / voxsrc_2019
View on GitHub
VoxSRC Challenge
☆31Jun 11, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ga642381 / Spoken-Dialogue-Model-Survey
View on GitHub
A survey of spoken dialogue models (SDMs) with speech input and speech output. Focus on their Intermediate Representation and Generation …
☆31Mar 24, 2026Updated 4 months ago
SpeechClub / CDER_Metric
View on GitHub
CDER (Conversational Diarization Error Rate) Scoring Tool
☆22Sep 13, 2022Updated 3 years ago
zcxu-eric / AVA-AVD
View on GitHub
☆51Nov 24, 2022Updated 3 years ago
ftshijt / speech_evaluation
View on GitHub
A toolkit dedicate for speech evaluation.
☆23Sep 26, 2024Updated last year
kaistmm / VoxMM
View on GitHub
☆23May 11, 2026Updated 2 months ago
llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated last year
ms-dot-k / TMT
View on GitHub
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages
☆18May 23, 2024Updated 2 years ago
LingweiMeng / Whisper-Sidecar
View on GitHub
The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".
☆34Aug 2, 2025Updated 11 months ago
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 3 years ago
dynamic-superb / dynamic-superb
View on GitHub
The official repository of Dynamic-SUPERB.
☆200Jun 24, 2025Updated last year
Jungjee / RawNet
View on GitHub
Official repository for RawNet, RawNet2, and RawNet3
☆407Mar 21, 2024Updated 2 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Updated this week
ICLR-DAP / Deep-Audio-Prior
View on GitHub
Anonymous ICLR Submission
☆14Sep 25, 2019Updated 6 years ago
k2-fsa / sherpa-mlx
View on GitHub
sherpa with mlx
☆15Aug 2, 2025Updated 11 months ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago