Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021
☆26Oct 5, 2022Updated 3 years ago
Alternatives and similar repositories for MTL-Speaker-Embeddings
Users that are interested in MTL-Speaker-Embeddings are comparing it to the libraries listed below
Sorting:
- MultiSV: scripts for data preparation☆30Jan 18, 2025Updated last year
- ☆17Jun 30, 2020Updated 5 years ago
- ☆14Jul 24, 2025Updated 7 months ago
- ☆28Dec 22, 2021Updated 4 years ago
- TDY-CNN for text-independent speaker verification☆19Nov 7, 2022Updated 3 years ago
- ☆32Sep 14, 2022Updated 3 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- ☆22Jun 30, 2021Updated 4 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Dec 18, 2021Updated 4 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- This repository created for the NHN ASR hackathon competition.☆11Sep 20, 2023Updated 2 years ago
- ☆11Nov 7, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- ☆11Jun 14, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Vox-Profile Benchmark☆72Feb 16, 2026Updated 3 weeks ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Jul 16, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- ID R&D Voice Antispoofing Challenge Solution☆11Jul 27, 2019Updated 6 years ago
- Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Syst…☆13Feb 17, 2021Updated 5 years ago