Ming-er/MGA-CLAP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ming-er/MGA-CLAP)

Ming-er / MGA-CLAP

official implementation of MGA-CLAP (ACM MM 2024)

☆29

Alternatives and similar repositories for MGA-CLAP

Users that are interested in MGA-CLAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Ming-er / LGC-SED
View on GitHub
☆13Jan 3, 2024Updated 2 years ago
Heisenburger2020 / Vabs-Net
View on GitHub
Vabs-Net: Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains
☆17Sep 12, 2024Updated last year
Ming-er / Audio-Free-P-Tuning
View on GitHub
☆11Dec 28, 2023Updated 2 years ago
Ruoyu-Xu / SCKD
View on GitHub
AAAI 2025
☆17Dec 13, 2024Updated last year
pFindStudio / pUniFind
View on GitHub
[Nature Machine Intelligence], the first open de novo sequencing and open database search rescoring deep learning model.
☆43May 30, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
soham97 / ADIFF
View on GitHub
Explaining audio differences using language
☆16Feb 11, 2025Updated last year
AweAI-Team / DeNovoSWE
View on GitHub
Scaling Long-Horizon SWE Environments
☆42Updated this week
WWWWxp / Speech-Tokenizer-Papers
View on GitHub
This repository collects papers related to Speech Tokenizer.
☆18Oct 16, 2024Updated last year
wsntxxn / TextToAudioGrounding
View on GitHub
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆49Oct 23, 2025Updated 9 months ago
Chen-GX / ReForm
View on GitHub
☆21Jan 31, 2026Updated 5 months ago
wdqqdw / Echo
View on GitHub
Project page of "2026-ICLR Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning"
☆16Mar 26, 2026Updated 3 months ago
inclusionAI / AudioMCQ
View on GitHub
[ICLR 2026] AudioMCQ: A 571k audio multiple-choice question dataset for post-training Large Audio Language Models with dual CoT annotatio…
☆51Apr 21, 2026Updated 3 months ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
FrontierLabs / F5R-TTS
View on GitHub
Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"
☆169Mar 3, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
john852517791 / pytorch_lightning_FAD
View on GitHub
This is a general framework for fake audio detection using pytorch lightning
☆30Jul 24, 2025Updated last year
xiaomi-research / dasheng-glap
View on GitHub
Official Implementation of GLAP - General Language Audio Pretraining
☆74May 14, 2026Updated 2 months ago
Chen-GX / SEER
View on GitHub
☆15Feb 10, 2025Updated last year
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆23Jul 10, 2024Updated 2 years ago
AweAI-Team / BeyondSWE
View on GitHub
☆47Updated this week
xieyuankun / All-Type-ADD
View on GitHub
This is the repo of our work titled “Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception”
☆34Mar 31, 2026Updated 3 months ago
Xia-aaa / L3former
View on GitHub
☆14Jun 26, 2025Updated last year
Yuto-Matsunaga / Prompt_Tuning_for_Audio_Deepfake_Detection
View on GitHub
☆13Nov 12, 2024Updated last year
qiuqiangkong / mini_music_tagging
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xieyuankun / VITS-chinese-finetune
View on GitHub
语音合成VITS 纯中文微调
☆12Mar 15, 2023Updated 3 years ago
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
cai525 / Transformer4SED
View on GitHub
This repository aims to collect Transformer-based sound event detection (SED) algorithms.
☆104Feb 10, 2026Updated 5 months ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
Heisenburger2020 / Multidimensional-Information-Assisted-Deep-Learning-Realizing-Flexible_Recognition_of_Vortex_Beam
View on GitHub
This project made use of both intensity and phase information to recognize orbital angular momentum mode.
☆36Jul 8, 2024Updated 2 years ago
lysanderism / TimeAudio
View on GitHub
The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…
☆30Nov 18, 2025Updated 8 months ago
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
Ruiqi-Yan / URO-Bench
View on GitHub
Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models
☆55Sep 2, 2025Updated 10 months ago
ETH-DISCO / sao-instruct
View on GitHub
Official repo for SAO-Instruct: Free-form Audio Editing using Natural Language Instructions presented at NeurIPS 2025
☆18Oct 28, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Sreyan88 / GAMA
View on GitHub
Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
☆153Dec 5, 2024Updated last year
SVDDChallenge / CtrSVDD_Utils
View on GitHub
☆18Jan 10, 2024Updated 2 years ago
onejiin / CycleGAN-VC2
View on GitHub
CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion
☆42Mar 2, 2020Updated 6 years ago
nttcslab / m2d
View on GitHub
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
☆162Feb 23, 2026Updated 5 months ago
Peiying-Yu / Table-Critic
View on GitHub
A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning.
☆21Aug 23, 2025Updated 11 months ago
qiuqiangkong / mini_llm
View on GitHub
☆29Jul 4, 2025Updated last year
AweAI-Team / ScaleSWE
View on GitHub
☆87Updated this week