PunkMale / OR-Gate
Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.
☆11Updated last year
Alternatives and similar repositories for OR-Gate:
Users that are interested in OR-Gate are comparing it to the libraries listed below
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- ☆16Updated 4 months ago
- ☆18Updated 7 months ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 10 months ago
- ☆11Updated last year
- ☆30Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated 7 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- ☆43Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- ☆36Updated 2 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆28Updated 9 months ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆16Updated 9 months ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆21Updated last year
- Self-supervised Speaker Diarization Interspeech 2022 Implementation☆9Updated 6 months ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆16Updated 3 weeks ago
- Speechflow for emotion recognition related information decomposition☆10Updated 3 years ago
- ☆43Updated 2 years ago
- ☆25Updated 5 months ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆20Updated 3 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- ☆30Updated 5 months ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated last month
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 4 years ago
- faster inference☆28Updated 3 months ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- Speech samples and code of BEdit-TTS☆32Updated last year