rainavyas / prepend_acoustic_attack
Prepend universal audio attack segment to mute Whisper
☆18Updated last month
Alternatives and similar repositories for prepend_acoustic_attack:
Users that are interested in prepend_acoustic_attack are comparing it to the libraries listed below
- Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking☆32Updated 4 months ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆13Updated 5 months ago
- EMO-SUPERB submission☆42Updated 4 months ago
- ☆33Updated last month
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆48Updated 6 months ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆22Updated last year
- ☆14Updated 6 months ago
- This repository presents a subset of our proposed FSD dataset for song deepfake detection.☆21Updated 4 months ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆27Updated 4 months ago
- ARCH: Audio Representations benCHmark☆39Updated 4 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆109Updated this week
- The open source code for LLM-Codec☆120Updated 5 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 11 months ago
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆22Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆51Updated 2 months ago
- This repository collects papers related to Speech Tokenizer.☆15Updated 3 months ago
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension☆69Updated last month
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆102Updated last month
- Survey on speech generation work.☆15Updated last year
- ☆39Updated 4 months ago
- ☆19Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆23Updated 3 months ago
- A toolkit dedicate for speech evaluation.☆19Updated 3 months ago
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆44Updated this week
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆43Updated this week
- ☆15Updated 3 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆48Updated 2 weeks ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆32Updated 9 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆70Updated 3 months ago
- ☆23Updated 2 years ago