Prepend universal audio attack segment to mute Whisper
☆39Jan 22, 2025Updated last year
Alternatives and similar repositories for prepend_acoustic_attack
Users that are interested in prepend_acoustic_attack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jun 11, 2024Updated 2 years ago
- ☆25Dec 20, 2022Updated 3 years ago
- ☆23Apr 3, 2025Updated last year
- ☆16Oct 18, 2023Updated 2 years ago
- This repository follows papers and reports on discrete speech representation learning and speech tokenization methods for speech language…☆15Dec 1, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23May 19, 2026Updated last month
- Code of paper "AdvReverb: AdvReverb: Rethinking the Stealthiness of Audio Adversarial Examples to Human Perception"☆21Nov 26, 2023Updated 2 years ago
- Re-implementation of SLAM-ASR paper's experiment, using Phi-2 and Hubert☆21Jun 14, 2024Updated 2 years ago
- [USENIX Security 2025] SafeSpeech: Robust and Universal Voice Protection Against Malicious Speech Synthesis☆37May 24, 2025Updated last year
- ☆14Apr 6, 2025Updated last year
- ☆25Feb 14, 2024Updated 2 years ago
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆17Dec 1, 2022Updated 3 years ago
- [MICCAI 2024] MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality☆14Sep 26, 2025Updated 9 months ago
- A unified robotic manipulation learning framework☆23Sep 4, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Sep 6, 2024Updated last year
- ☆13Aug 26, 2024Updated last year
- ☆29Sep 5, 2024Updated last year
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Aug 20, 2023Updated 2 years ago
- ☆15Mar 3, 2025Updated last year
- Offical Repository of MetaAgent Program☆52Dec 2, 2025Updated 7 months ago
- ☆20Sep 2, 2024Updated last year
- Official implementation of the paper: "NeoBabel: A Multilingual Open Tower for Visual Generation"☆23Aug 4, 2025Updated 10 months ago
- ☆18Apr 6, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code repository for Ensemble-based Blackbox Attacks on Dense Prediction (EBAD), CVPR 2023☆28May 17, 2024Updated 2 years ago
- Illustrating EM for GMMs and HMMs☆12May 9, 2020Updated 6 years ago
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆29Jun 6, 2022Updated 4 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆23Oct 19, 2023Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- ☆13Jan 5, 2025Updated last year
- ☆15Dec 18, 2024Updated last year
- Code for MICCAI2023 paper: TransLiver: A Hybrid Transformer Model for Multi-phase Liver Lesion Classification☆18Jan 10, 2024Updated 2 years ago
- The official implementation of the paper "Defending Your Voice: Adversarial Attack on Voice Conversion".☆53May 15, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)☆39Jul 24, 2025Updated 11 months ago
- The implementation of our NeurIPS 2024 paper "DarkSAM: Fooling Segment Anything Model to Segment Nothing".☆14Nov 4, 2024Updated last year
- ☆11Jul 14, 2023Updated 2 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆21Sep 25, 2023Updated 2 years ago
- Official repository for GraFPrint: an audio identification framework based on graph neural networks.☆41Sep 18, 2025Updated 9 months ago
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆84Jun 11, 2026Updated 3 weeks ago
- Auditing agents for fine-tuning safety☆21Oct 21, 2025Updated 8 months ago