[INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition"
☆18Jul 23, 2024Updated last year
Alternatives and similar repositories for voxceleb-disentangler
Users that are interested in voxceleb-disentangler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- ☆16Apr 24, 2025Updated 11 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 5 months ago
- ☆13Oct 25, 2024Updated last year
- ☆11Jun 14, 2024Updated last year
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- ☆70Feb 15, 2021Updated 5 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 3 years ago
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆52Apr 9, 2025Updated last year
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆36Jan 6, 2026Updated 3 months ago
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- ☆11Mar 4, 2026Updated last month
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Dec 18, 2021Updated 4 years ago
- ☆17Jul 22, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆11Sep 4, 2023Updated 2 years ago
- ☆25Jul 15, 2024Updated last year
- ☆18Sep 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆28Dec 22, 2021Updated 4 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Jan 10, 2025Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- 记录图像处理相关算法openv实现☆10Jun 26, 2018Updated 7 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- [INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"☆57Nov 3, 2025Updated 5 months ago
- Visual Speech Recongnition☆20Dec 24, 2024Updated last year
- Lung Extraction from Chest X-ray for Efficient Computing☆15May 13, 2019Updated 6 years ago
- TensorFlow implementation of 'Residual Dense Network for Image Super-Resolution'☆13Mar 28, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆30Nov 7, 2023Updated 2 years ago
- ☆18Mar 13, 2024Updated 2 years ago
- TensorFlow implementation of very deep convolution network for image super-resolution☆13Mar 28, 2019Updated 7 years ago
- ☆16Feb 19, 2026Updated 2 months ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- Sound Event Detection (SED) paper collection☆17Jun 26, 2024Updated last year
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year