ZeyueT/AudioX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZeyueT/AudioX)

ZeyueT / AudioX

[ICLR 2026] Repository of AudioX

☆1,544

Alternatives and similar repositories for AudioX

Users that are interested in AudioX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sinberCS / switch2ai
View on GitHub
switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…
☆173Nov 11, 2025Updated 8 months ago
zhangyulin-space / ChatFerry
View on GitHub
☆104Oct 8, 2025Updated 9 months ago
ZeyueT / Audio-Omni
View on GitHub
[SIGGRAPH 2026] Repository of Audio-Omni
☆398Jun 10, 2026Updated last month
AlenjandroWang / ASVR
View on GitHub
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better
☆190Apr 7, 2026Updated 3 months ago
MarkLee131 / PoC-Research-Papers
View on GitHub
Research papers on Proot-of-Concepts
☆114Feb 3, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ant-research / AvatarArtist
View on GitHub
[CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.
☆280Jun 14, 2025Updated last year
Tanglumy / Finance-Bro
View on GitHub
your finance bro Agent for trading and investing
☆111Nov 8, 2025Updated 8 months ago
Jinxhy / THEMIS
View on GitHub
[USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models
☆108Aug 13, 2025Updated 11 months ago
DEFENSE-SEU / RobustFlow
View on GitHub
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
☆238Oct 19, 2025Updated 9 months ago
ECNU-SII / Continual-NExT
View on GitHub
☆235Jun 27, 2026Updated 3 weeks ago
gulucaptain / DynamiCtrl
View on GitHub
[TMM'26] Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.
☆142May 23, 2025Updated last year
serendipity800 / open-motion-apis
View on GitHub
☆80Mar 5, 2026Updated 4 months ago
XindaLi304 / TACOformer
View on GitHub
A neural network for emotion recognition based on multimodal physiological signal
☆81Feb 23, 2026Updated 5 months ago
aoda-zhang / PawHaven-FullStack-React-NodeJS
View on GitHub
🐱 PawHaven — an open-source platform that helps volunteers, shelters, and adopters report, track, and share stray animal rescue cases (f…
☆90Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AIR-DISCOVER / FreeAskWorld
View on GitHub
[AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…
☆228Jul 3, 2026Updated 3 weeks ago
yunbeizhang / Awesome-Visual-Prompt-Tuning
View on GitHub
[TMLR] A curated list of awesome papers, resources, and tools for Visual Prompt Tuning (VPT).
☆115Feb 22, 2026Updated 5 months ago
liufanfanlff / C3-Context-Cascade-Compression
View on GitHub
Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression
☆313Jan 27, 2026Updated 5 months ago
Tsinghua-dhy / UR2
View on GitHub
UR2: Unify RAG and Reasoning through Reinforcement Learning
☆131May 26, 2026Updated last month
TIML-Group / Conformal-Prediction-Unlearning
View on GitHub
☆61May 1, 2026Updated 2 months ago
QwenAudio / ThinkSound
View on GitHub
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Tho…
☆1,372Apr 3, 2026Updated 3 months ago
bcmi / Object-Reflection-Generation-Dataset-DEROBA
View on GitHub
The dataset, code, and model for our paper "Reflection Generation for Composite Image Using Diffusion Model", ICME, 2026.
☆58Apr 4, 2026Updated 3 months ago
THUDM / INFTY
View on GitHub
INFTY Engine: An Optimization Toolkit to Support Continual AI
☆573Jun 8, 2026Updated last month
TIML-Group / Mode-Connectivity-Unlearning
View on GitHub
☆53May 1, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HKUDS / LightReasoner
View on GitHub
[ACL 2026 Oral] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"
☆603May 22, 2026Updated 2 months ago
damo-cv / JCo-MVTON
View on GitHub
☆124Aug 29, 2025Updated 10 months ago
EDAPINENUT / ExplicitShortCut
View on GitHub
Official implementation of the paper <On the Design of One-Step Diffusion via Shortcutting Flow Paths>
☆286Apr 1, 2026Updated 3 months ago
Victor20082018 / -Optimized-Aquatic-Target-Recognition-Model
View on GitHub
The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…
☆48May 15, 2025Updated last year
tangpan360 / MicroRCA-Agent
View on GitHub
2025 CCF International AIOps Challenge | Track 1: Microservice Root Cause Localization Based on Large Model Agents | "男团910" Solution · T…
☆256Jan 14, 2026Updated 6 months ago
MarkLee131 / Hypervisor-Testing-Survey
View on GitHub
A collection of research papers on hypervisor testing.
☆65May 21, 2026Updated 2 months ago
nanxiang11 / CodeLab_LLM
View on GitHub
🌟 从LLaMA2开启大语言模型原理与实践教程
☆76Oct 29, 2025Updated 8 months ago
gwh22 / UniVoice
View on GitHub
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
☆115Oct 30, 2025Updated 8 months ago
TIML-Group / Robust-MoE-Dual-Model
View on GitHub
☆42Aug 17, 2025Updated 11 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Harrydirk41 / ProTDyn
View on GitHub
Generative Protein Emulator
☆69Sep 25, 2025Updated 9 months ago
bcmi / OSInsert-Image-Composition
View on GitHub
☆62Jun 28, 2026Updated 3 weeks ago
kand-ta / kand
View on GitHub
Kand: Blazing-Fast, Modern Technical Analysis in Rust, Python, and WASM.
☆564Jan 22, 2026Updated 6 months ago
zzhuang94 / gin-vue-web
View on GitHub
一个基于 Gin 和 Vue 的企业级全栈 Web 开发框架，专为快速构建现代化管理平台而生。采用前后端分离架构，通过约定优于配置的设计理念，将传统 CRUD 开发效率提升 10 倍以上。 A minimal MVC web framework built with Gin…
☆31May 27, 2026Updated last month
wguo-ai / SSV2A
View on GitHub
Gotta Hear Them All: Towards Sound Source Aware Audio Generation.
☆69Nov 15, 2025Updated 8 months ago
bird-bench / BIRD-Interact
View on GitHub
[ICLR 2026 Oral] BIRD-INTERACT: Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.
☆1,009Mar 29, 2026Updated 3 months ago
DataArcTech / DataArc-SynData-Toolkit
View on GitHub
Synthetic Data Generation Platform By DataArcTech
☆1,762Jun 30, 2026Updated 3 weeks ago