ucas-hao / qwen_audio_for_addView external linksLinks
[ACMMM2025] Official released code for ALLM4ADD
☆36Oct 30, 2025Updated 3 months ago
Alternatives and similar repositories for qwen_audio_for_add
Users that are interested in qwen_audio_for_add are comparing it to the libraries listed below
Sorting:
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆25Nov 7, 2023Updated 2 years ago
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆20Jul 27, 2024Updated last year
- [INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection☆54Dec 4, 2024Updated last year
- Official code of "IRNet: Iterative Refinement Network for Noisy Partial Label Learning"☆21Oct 8, 2025Updated 4 months ago
- Official code of "ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning"☆24Sep 25, 2023Updated 2 years ago
- A list of tools, papers and code related to Fake Audio Detection.☆223Dec 10, 2025Updated 2 months ago
- Official Implementation of the paper "XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack Detection"☆36Feb 2, 2026Updated 2 weeks ago
- ☆54Nov 14, 2025Updated 3 months ago
- Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"☆37Jul 24, 2023Updated 2 years ago
- ☆13Aug 28, 2024Updated last year
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- [CVPR2025] From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification☆16Aug 28, 2025Updated 5 months ago
- Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking☆45Aug 23, 2024Updated last year
- About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…☆13Jan 14, 2026Updated last month
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- ☆14Sep 17, 2024Updated last year
- Fine-tuning Llama2-7b and other llms for categorising emails for Deutsche Bahn (German National Railways)☆13Oct 9, 2023Updated 2 years ago
- [AAAI2025 Oral] BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking☆12Apr 22, 2025Updated 9 months ago
- 🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)☆17Updated this week
- ☆24Sep 11, 2025Updated 5 months ago
- Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023☆12Aug 24, 2025Updated 5 months ago
- ☆12Aug 24, 2020Updated 5 years ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆16Feb 15, 2025Updated last year
- This repository includes the code to reproduce our paper "Raw Differentiable Architecture Search for Speech Deepfake and Spoofing Detecti…☆11Jul 11, 2023Updated 2 years ago
- Qualifying Exam Preparing☆16May 7, 2025Updated 9 months ago
- ☆11May 12, 2024Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆17Mar 3, 2025Updated 11 months ago
- Includes additional materials for the following keras.io blog post.☆12Jun 23, 2021Updated 4 years ago
- Automatic Metric for Evaluating Generated Videos☆32Dec 8, 2025Updated 2 months ago
- ☆12Oct 24, 2017Updated 8 years ago
- ☆13Jul 17, 2024Updated last year
- Code for paper "Audio Deepfake Detection with Self-supervised XLS-R and SLS classifier☆59Feb 7, 2025Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.☆16Oct 25, 2024Updated last year
- ICL backdoor attack☆17Nov 4, 2024Updated last year
- I fine-tuned (p-tuning) Tsinghua’s open-source large language model, ChatGLM2-6B, using several years of my WeChat chat history. Inspired…☆12Mar 6, 2024Updated last year
- ☆16Feb 8, 2024Updated 2 years ago
- SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods☆22Aug 13, 2025Updated 6 months ago