cvqluu / MTL-Speaker-EmbeddingsView external linksLinks
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021
☆26Oct 5, 2022Updated 3 years ago
Alternatives and similar repositories for MTL-Speaker-Embeddings
Users that are interested in MTL-Speaker-Embeddings are comparing it to the libraries listed below
Sorting:
- MultiSV: scripts for data preparation☆30Jan 18, 2025Updated last year
- ☆17Jun 30, 2020Updated 5 years ago
- ☆14Jul 24, 2025Updated 6 months ago
- ☆28Dec 22, 2021Updated 4 years ago
- TDY-CNN for text-independent speaker verification☆19Nov 7, 2022Updated 3 years ago
- ☆32Sep 14, 2022Updated 3 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- ☆22Jun 30, 2021Updated 4 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 8 months ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Dec 18, 2021Updated 4 years ago
- Vox-Profile Benchmark☆67Updated this week
- ☆10Dec 22, 2023Updated 2 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- ☆11Jun 14, 2024Updated last year
- This repository created for the NHN ASR hackathon competition.☆11Sep 20, 2023Updated 2 years ago
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- text to speech☆10Mar 19, 2024Updated last year
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- ☆11Nov 7, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Jul 16, 2024Updated last year
- ☆15Nov 10, 2025Updated 3 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- ☆14Aug 1, 2025Updated 6 months ago
- ☆11Nov 5, 2025Updated 3 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Feb 5, 2025Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- ID R&D Voice Antispoofing Challenge Solution☆11Jul 27, 2019Updated 6 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- DysfluentWFST☆17Nov 13, 2025Updated 3 months ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated last year