mileskuo42 / AudioMarkBenchLinks
Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking
☆43Updated last year
Alternatives and similar repositories for AudioMarkBench
Users that are interested in AudioMarkBench are comparing it to the libraries listed below
Sorting:
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 2 months ago
- Official Repository for "SingFake: Singing Voice Deepfake Detection"☆63Updated last year
- ARCH: Audio Representations benCHmark☆52Updated last year
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Updated 3 weeks ago
- TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking☆20Updated 7 months ago
- ☆78Updated 3 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- ☆61Updated this week
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34Updated last year
- ☆19Updated 2 years ago
- ☆32Updated last year
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆76Updated 2 years ago
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆41Updated 2 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Updated last year
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Updated last year
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated 3 weeks ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated last year
- This repository collects papers related to Speech Tokenizer.☆17Updated last year
- The open source code for LLM-Codec☆142Updated last year
- PAM is a no-reference audio quality metric for audio generation tasks☆76Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆41Updated 6 months ago
- ☆110Updated 3 months ago
- ☆35Updated 2 years ago
- Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'☆147Updated 8 months ago
- This repository presents FSD dataset for song deepfake detection.☆24Updated 3 months ago
- ☆24Updated 3 months ago
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆54Updated last year
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆87Updated 2 years ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Updated 2 years ago
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆32Updated 8 months ago