PunkMale / OR-Gate
Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for OR-Gate
- CDER (Conversational Diarization Error Rate) Scoring Tool☆15Updated 2 years ago
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆22Updated 8 months ago
- ☆41Updated last year
- SRTNet☆24Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- ☆13Updated 3 months ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- PyTorch-based implementations of short-time Fourier transform☆15Updated last year
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆19Updated 3 years ago
- ☆41Updated last year
- A probabilistic scoring backend for length-normalized embeddings.☆10Updated 6 months ago
- ☆26Updated last year
- Streaming Audiotransformers for online Audio tagging☆41Updated 4 months ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆39Updated last year
- ☆13Updated 2 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆25Updated 4 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆15Updated 2 years ago
- ☆13Updated last year
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆8Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆32Updated 2 years ago
- real-time speech enhance☆12Updated 9 months ago
- ☆31Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago