sivannavis / samo
SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING
☆39Updated 2 years ago
Alternatives and similar repositories for samo:
Users that are interested in samo are comparing it to the libraries listed below
- US-based professors who work on audio. For students who would like to apply for RA, PhD, postdoc in audio research.☆25Updated 3 weeks ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 6 months ago
- ☆30Updated last year
- Streaming Audiotransformers for online Audio tagging☆44Updated 10 months ago
- Official Repository for "SingFake: Singing Voice Deepfake Detection"☆53Updated last year
- ☆49Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- ☆30Updated 5 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- ☆17Updated last week
- ☆33Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated last month
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆26Updated last month
- Speech Human Evaluation Estimation Toolkit (SHEET)☆65Updated 5 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆32Updated 8 months ago
- ☆24Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆31Updated last month
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆22Updated last year
- ☆43Updated 10 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- TODO☆38Updated last year
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated last month
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆54Updated 3 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆47Updated last week
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 3 months ago
- Viterbi decoding in PyTorch☆30Updated 3 weeks ago
- ☆17Updated last year
- ☆48Updated 3 weeks ago
- LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, w…☆16Updated 2 years ago
- ☆48Updated 7 months ago