shkim816 / acnn_speaker_recog
acnn for text-independent speaker recognition
☆9Updated 3 years ago
Alternatives and similar repositories for acnn_speaker_recog:
Users that are interested in acnn_speaker_recog are comparing it to the libraries listed below
- TDY-CNN for text-independent speaker verification☆17Updated 2 years ago
- ☆29Updated 3 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 5 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated this week
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- ☆32Updated 2 years ago
- ☆29Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated last month
- ☆30Updated last year
- ☆33Updated 3 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆19Updated 5 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- ☆31Updated 2 years ago
- ☆26Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 7 months ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆12Updated 3 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- ☆13Updated last year
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- ☆14Updated 2 years ago
- MultiSV: scripts for data preparation☆27Updated 2 months ago
- Discriminative Training of VBx Diarization☆23Updated 5 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆28Updated 2 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆24Updated 3 months ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆24Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Updated 3 years ago
- ☆10Updated last year