hechmik / voxceleb_enrichment_age_genderView external linksLinks
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
☆71Dec 18, 2021Updated 4 years ago
Alternatives and similar repositories for voxceleb_enrichment_age_gender
Users that are interested in voxceleb_enrichment_age_gender are comparing it to the libraries listed below
Sorting:
- ☆28Dec 22, 2021Updated 4 years ago
- How to use our public wav2vec2 age and gender model☆53Sep 4, 2023Updated 2 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68May 12, 2021Updated 4 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- ☆13Sep 26, 2023Updated 2 years ago
- ☆16Dec 17, 2024Updated last year
- Privacy-preserving Voice Analysis via Disentangled Representations☆11Aug 30, 2021Updated 4 years ago
- ☆53Jan 15, 2021Updated 5 years ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- MSP-Podcast Challenge Baseline Code☆30Jun 12, 2024Updated last year
- ☆17Jan 26, 2021Updated 5 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software☆68Oct 17, 2024Updated last year
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 3 years ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated last year
- ☆12Jun 14, 2022Updated 3 years ago
- ☆16Mar 7, 2019Updated 6 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆61Jan 30, 2025Updated last year
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Jan 23, 2022Updated 4 years ago
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆40Jun 17, 2025Updated 8 months ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 4 months ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,203Feb 11, 2026Updated last week
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 7 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated 11 months ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Official Implementation of VoxTracer (MM' 23)☆11Oct 27, 2023Updated 2 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Dec 31, 2021Updated 4 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Sep 15, 2021Updated 4 years ago