mileskuo42 / AudioMarkBenchLinks
Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking
☆44Updated last year
Alternatives and similar repositories for AudioMarkBench
Users that are interested in AudioMarkBench are comparing it to the libraries listed below
Sorting:
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 4 months ago
- ARCH: Audio Representations benCHmark☆53Updated last year
- Official Repository for "SingFake: Singing Voice Deepfake Detection"☆63Updated last year
- TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking☆20Updated 9 months ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated 2 months ago
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Updated 2 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆32Updated last year
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Updated last year
- This repository presents FSD dataset for song deepfake detection.☆25Updated 5 months ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆16Updated 5 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆77Updated last week
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆45Updated 8 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated 2 years ago
- ☆35Updated last year
- ☆16Updated 5 months ago
- Official repository for U-SAM (Interspeech 2025)☆24Updated 7 months ago
- ☆78Updated 5 months ago
- ☆32Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Updated 2 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆57Updated 3 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆76Updated last year
- ☆45Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated 2 years ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Updated 7 months ago
- ☆24Updated 4 months ago
- ☆57Updated last year
- ☆35Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago