Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking
☆45Aug 23, 2024Updated last year
Alternatives and similar repositories for AudioMarkBench
Users that are interested in AudioMarkBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking☆21Apr 18, 2025Updated 11 months ago
- AI-based Audio Watermarking Tool☆306Jan 7, 2024Updated 2 years ago
- ☆65Oct 23, 2024Updated last year
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆17Jul 31, 2025Updated 7 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆47May 16, 2025Updated 10 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆33Dec 23, 2025Updated 3 months ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated 11 months ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆58Oct 8, 2025Updated 5 months ago
- Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector☆691Feb 25, 2026Updated 3 weeks ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…☆442Jan 25, 2024Updated 2 years ago
- ☆14Oct 3, 2025Updated 5 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆78Feb 9, 2026Updated last month
- [ACM MM 24] GROOT:Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis☆20Mar 24, 2025Updated last year
- ☆18Jan 10, 2024Updated 2 years ago
- Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024☆17Aug 26, 2024Updated last year
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆90Dec 20, 2024Updated last year
- This repository collects papers related to Speech Tokenizer.☆17Oct 16, 2024Updated last year
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆156Nov 30, 2025Updated 3 months ago
- ☆12Mar 11, 2025Updated last year
- ☆100Jan 19, 2026Updated 2 months ago
- ☆20Jul 19, 2024Updated last year
- Official code for SongEcho☆52Mar 3, 2026Updated 3 weeks ago
- ☆14Jun 16, 2023Updated 2 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- ☆75Jul 22, 2024Updated last year
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".☆219Mar 17, 2026Updated last week
- A Python toolkit for data-driven HRTF research☆16Feb 6, 2025Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆348Jul 21, 2025Updated 8 months ago
- Versatile Evaluation of Speech and Audio☆394Dec 9, 2025Updated 3 months ago
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Feb 27, 2026Updated 3 weeks ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆60Oct 23, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year