[ACMMM2025] Official released code for ALLM4ADD
☆36Oct 30, 2025Updated 4 months ago
Alternatives and similar repositories for qwen_audio_for_add
Users that are interested in qwen_audio_for_add are comparing it to the libraries listed below
Sorting:
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆26Nov 7, 2023Updated 2 years ago
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- Official code of "IRNet: Iterative Refinement Network for Noisy Partial Label Learning"☆21Oct 8, 2025Updated 5 months ago
- Official code of "ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning"☆24Sep 25, 2023Updated 2 years ago
- Official Implementation of the paper "XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack Detection"☆36Feb 2, 2026Updated last month
- ☆54Nov 14, 2025Updated 3 months ago
- Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"☆37Jul 24, 2023Updated 2 years ago
- [CVPR2025] From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification☆17Aug 28, 2025Updated 6 months ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking☆45Aug 23, 2024Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- Fine-tuning Llama2-7b and other llms for categorising emails for Deutsche Bahn (German National Railways)☆13Oct 9, 2023Updated 2 years ago
- ☆24Sep 11, 2025Updated 5 months ago
- About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…☆13Jan 14, 2026Updated last month
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- [AAAI2025 Oral] BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking☆13Apr 22, 2025Updated 10 months ago
- 🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)☆17Feb 13, 2026Updated 3 weeks ago
- ☆14Sep 17, 2024Updated last year
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023☆12Aug 24, 2025Updated 6 months ago
- This repository includes the code to reproduce our paper "Raw Differentiable Architecture Search for Speech Deepfake and Spoofing Detecti…☆11Jul 11, 2023Updated 2 years ago
- Qualifying Exam Preparing☆16May 7, 2025Updated 10 months ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆16Feb 15, 2025Updated last year
- Includes additional materials for the following keras.io blog post.☆12Jun 23, 2021Updated 4 years ago
- ☆11May 12, 2024Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆17Mar 3, 2025Updated last year
- The code for the paper "ECR-Chain: Advancing Generative Language Models to Better Emotion Cause Reasoners through Reasoning Chains" (IJCA…☆12May 4, 2024Updated last year
- ☆60Jul 15, 2024Updated last year
- ☆12Oct 24, 2017Updated 8 years ago
- ☆13Jul 17, 2024Updated last year
- Automatic Metric for Evaluating Generated Videos☆33Dec 8, 2025Updated 3 months ago
- ☆16Feb 8, 2024Updated 2 years ago
- I fine-tuned (p-tuning) Tsinghua’s open-source large language model, ChatGLM2-6B, using several years of my WeChat chat history. Inspired…☆12Mar 6, 2024Updated 2 years ago
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- Code for paper "Audio Deepfake Detection with Self-supervised XLS-R and SLS classifier☆60Feb 7, 2025Updated last year
- Official repository of the paper: Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics☆15Mar 22, 2024Updated last year
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆22Aug 14, 2025Updated 6 months ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods☆22Aug 13, 2025Updated 6 months ago