Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking
☆47Aug 23, 2024Updated last year
Alternatives and similar repositories for AudioMarkBench
Users that are interested in AudioMarkBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking☆21Apr 18, 2025Updated last year
- This repository provides a comprehensive benchmark for evaluating the performance of neural watermarking techniques. The benchmark includ…☆26Jan 9, 2026Updated 4 months ago
- AI-based Audio Watermarking Tool☆310Jan 7, 2024Updated 2 years ago
- ☆67Oct 23, 2024Updated last year
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆17Jul 31, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆48May 16, 2025Updated last year
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆52May 1, 2025Updated last year
- ☆33Dec 23, 2025Updated 5 months ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated last year
- Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector☆720Feb 25, 2026Updated 2 months ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆72Oct 8, 2025Updated 7 months ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…☆443Jan 25, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Oct 3, 2025Updated 7 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆79Feb 9, 2026Updated 3 months ago
- [ACM MM 24] GROOT:Generating Robust Watermark for Diffusion-Model-Based Audio Synthesis☆20Mar 24, 2025Updated last year
- ☆18Jan 10, 2024Updated 2 years ago
- Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024☆17Aug 26, 2024Updated last year
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆38Sep 9, 2025Updated 8 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆92Dec 20, 2024Updated last year
- ☆12Mar 11, 2025Updated last year
- ☆101Jan 19, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆165Nov 30, 2025Updated 5 months ago
- This repository collects papers related to Speech Tokenizer.☆18Oct 16, 2024Updated last year
- ☆20Jul 19, 2024Updated last year
- Official code for SongEcho☆63Mar 3, 2026Updated 2 months ago
- ☆14Jun 16, 2023Updated 2 years ago
- ☆92Jul 22, 2024Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 10 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆149Feb 9, 2026Updated 3 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".☆240Updated this week
- A Python toolkit for data-driven HRTF research☆16Feb 6, 2025Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆355Jul 21, 2025Updated 10 months ago
- Versatile Evaluation of Speech and Audio☆410Updated this week
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Feb 27, 2026Updated 2 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆14Mar 11, 2025Updated last year