TangciuYueng / AMemGuardLinks
☆24Updated last month
Alternatives and similar repositories for AMemGuard
Users that are interested in AMemGuard are comparing it to the libraries listed below
Sorting:
- Prepend universal audio attack segment to mute Whisper☆29Updated 10 months ago
- ☆29Updated 3 months ago
- SafeEar是由浙大和清华共同开发的一种深度伪声探测模型。这是我撰写的模型推理脚本。我不确定它是否正确,目前我还是 初学者,如有问题请原谅我并指出,谢谢!☆12Updated 6 months ago
- ☆118Updated 2 months ago
- ☆70Updated 2 months ago
- ☆11Updated 7 months ago
- ☆19Updated 5 months ago
- Chinese-Mimi 是对 Moshi 模型的声码器进行了中文语料上的适配。☆31Updated 8 months ago
- Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking☆43Updated last year
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆84Updated 2 months ago
- [ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms☆29Updated 5 months ago
- ☆19Updated 2 years ago
- [NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix☆179Updated 5 months ago
- [ICLR 2025] SONICS: Synthetic Or Not - Identifying Counterfeit Songs☆39Updated 6 months ago
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆111Updated 3 weeks ago
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆45Updated 3 months ago
- ☆49Updated 3 months ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆30Updated 8 months ago
- Data Pipeline, Models, and Benchmark for Omni-Captioner.☆92Updated last month
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Updated last week
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆94Updated 4 months ago
- ☆108Updated last month
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆53Updated last year
- ☆49Updated last year
- Official code implementation of SKU, Accepted by ACL 2024 Findings☆20Updated 11 months ago
- The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) arc…☆15Updated 9 months ago
- The official implementation of ImageBind-LLM and Whisper-LLM from the paper "Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Compre…☆21Updated 2 years ago
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆35Updated 4 months ago
- A comprehensive framework to test audio comprehension of Large Audio Language Models.☆56Updated this week
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆121Updated 2 months ago