awsaf49 / sonics
[ICLR 2025] SONICS: Synthetic Or Not - Identifying Counterfeit Songs
☆12Updated last month
Alternatives and similar repositories for sonics:
Users that are interested in sonics are comparing it to the libraries listed below
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 4 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 8 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated 7 months ago
- ☆24Updated last year
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆17Updated 2 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated last year
- ESLTTS dataset☆16Updated 2 months ago
- ViSpeR: Multilingual Audio-Visual Speech Recognition☆33Updated last week
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- ☆10Updated 3 years ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆52Updated 5 months ago
- Robust Neural Audio Watermarking with Invertible Dual-Embedding☆21Updated 5 months ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆22Updated 2 months ago
- ☆19Updated 2 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆16Updated 2 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- ☆23Updated 9 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆38Updated last year
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Updated 5 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆24Updated last month
- Source code for the paper 'Audio Captioning Transformer'☆54Updated 3 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆52Updated 6 months ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆63Updated 2 months ago
- SRTNet☆24Updated 2 years ago
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 7 months ago
- SSL Layerwise analysis for speech deepfake detection☆22Updated 2 months ago