wguo-ai/SSV2A

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wguo-ai/SSV2A)

wguo-ai / SSV2A

Gotta Hear Them All: Towards Sound Source Aware Audio Generation.

☆69

Alternatives and similar repositories for SSV2A

Users that are interested in SSV2A are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sinberCS / switch2ai
View on GitHub
switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…
☆173Nov 11, 2025Updated 8 months ago
gulucaptain / DynamiCtrl
View on GitHub
[TMM'26] Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.
☆142May 23, 2025Updated last year
zhangyulin-space / ChatFerry
View on GitHub
☆104Oct 8, 2025Updated 9 months ago
MarkLee131 / PoC-Research-Papers
View on GitHub
Research papers on Proot-of-Concepts
☆114Feb 3, 2026Updated 5 months ago
Tanglumy / Finance-Bro
View on GitHub
your finance bro Agent for trading and investing
☆111Nov 8, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ECNU-SII / Continual-NExT
View on GitHub
☆235Jun 27, 2026Updated last month
DEFENSE-SEU / RobustFlow
View on GitHub
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
☆238Oct 19, 2025Updated 9 months ago
damo-cv / JCo-MVTON
View on GitHub
☆124Aug 29, 2025Updated 10 months ago
ant-research / AvatarArtist
View on GitHub
[CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.
☆280Jun 14, 2025Updated last year
serendipity800 / open-motion-apis
View on GitHub
☆80Mar 5, 2026Updated 4 months ago
Victor20082018 / -Optimized-Aquatic-Target-Recognition-Model
View on GitHub
The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…
☆48May 15, 2025Updated last year
Jinxhy / THEMIS
View on GitHub
[USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models
☆108Aug 13, 2025Updated 11 months ago
Tsinghua-dhy / UR2
View on GitHub
UR2: Unify RAG and Reasoning through Reinforcement Learning
☆131May 26, 2026Updated 2 months ago
aoda-zhang / PawHaven-FullStack-React-NodeJS
View on GitHub
🐱 PawHaven — an open-source platform that helps volunteers, shelters, and adopters report, track, and share stray animal rescue cases (f…
☆90Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yunbeizhang / Awesome-Visual-Prompt-Tuning
View on GitHub
[TMLR] A curated list of awesome papers, resources, and tools for Visual Prompt Tuning (VPT).
☆115Feb 22, 2026Updated 5 months ago
liufanfanlff / C3-Context-Cascade-Compression
View on GitHub
Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression
☆313Jan 27, 2026Updated 6 months ago
AIR-DISCOVER / FreeAskWorld
View on GitHub
[AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…
☆229Jul 3, 2026Updated 3 weeks ago
MarkLee131 / Hypervisor-Testing-Survey
View on GitHub
A collection of research papers on hypervisor testing.
☆65May 21, 2026Updated 2 months ago
tangpan360 / MicroRCA-Agent
View on GitHub
2025 CCF International AIOps Challenge | Track 1: Microservice Root Cause Localization Based on Large Model Agents | "男团910" Solution · T…
☆259Jan 14, 2026Updated 6 months ago
EDAPINENUT / ExplicitShortCut
View on GitHub
Official implementation of the paper <On the Design of One-Step Diffusion via Shortcutting Flow Paths>
☆287Apr 1, 2026Updated 3 months ago
jiaweizzhao / InRank
View on GitHub
☆153Jan 2, 2024Updated 2 years ago
Harrydirk41 / ProTDyn
View on GitHub
Generative Protein Emulator
☆69Sep 25, 2025Updated 10 months ago
bcmi / OSInsert-Image-Composition
View on GitHub
☆62Jun 28, 2026Updated 3 weeks ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Ga-Lee / Frequency-aware-Length-EXtension
View on GitHub
official implementation for paper titled "Training-free Horizon Extension for Autoregressive Video Generation"
☆117Feb 17, 2026Updated 5 months ago
kand-ta / kand
View on GitHub
Kand: Blazing-Fast, Modern Technical Analysis in Rust, Python, and WASM.
☆564Jan 22, 2026Updated 6 months ago
RLHFlow / Reinforce-Ada
View on GitHub
[COLM 2026] An adaptive sampling framework for Reinforce-style LLM post training.
☆96Nov 29, 2025Updated 7 months ago
Ma-Zhuang / OmniNWM
View on GitHub
[ECCV 2026] OmniNWM: Omniscient Navigation World Models for Autonomous Driving
☆364Jun 18, 2026Updated last month
HKUDS / LightReasoner
View on GitHub
[ACL 2026 Oral] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"
☆604May 22, 2026Updated 2 months ago
gwh22 / UniVoice
View on GitHub
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
☆115Oct 30, 2025Updated 8 months ago
guanhaisu / OBSD
View on GitHub
Deciphering Oracle Bone Language with Diffusion Models (ACL 2024 Best Paper)
☆232Sep 17, 2025Updated 10 months ago
THUDM / INFTY
View on GitHub
INFTY Engine: An Optimization Toolkit to Support Continual AI
☆573Jun 8, 2026Updated last month
SII-Hui / NI-Tex
View on GitHub
[CVPR 2026 Highlight] It's the official repository of "NI-Tex: Non-isometric Image-based Garment Texture Generation".
☆71Apr 12, 2026Updated 3 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
YOUNG-bit / OpenGS-Fusion
View on GitHub
[IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding
☆77Aug 2, 2025Updated 11 months ago
xivv123 / speed-ui-vue
View on GitHub
一个 vue3 ui组件库
☆26Oct 27, 2025Updated 9 months ago
bcmi / Object-Reflection-Generation-Dataset-DEROBA
View on GitHub
The dataset, code, and model for our paper "Reflection Generation for Composite Image Using Diffusion Model", ICME, 2026.
☆58Apr 4, 2026Updated 3 months ago
BinaryFroggy / Hopet
View on GitHub
A macOS desktop AI pet that mirrors your Claude Code & Codex CLI session state in real time. 支持Claude Code和Codex CLI 的像素风AI桌面宠物，会话状态实时映射，…
☆61Updated this week
WittF / bilibili-qr-login
View on GitHub
🔳 哔哩哔哩扫码获取 cookie 网页工具
☆359Jun 3, 2026Updated last month
fengzeAltos / ROS2-Bag-Filter
View on GitHub
A user-friendly ROS 2 bag filter with a graphical user interface (GUI) ✨
☆27May 7, 2025Updated last year
ByteDance-Seed / DAComp
View on GitHub
[ICLR 2026] DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle
☆433Jul 10, 2026Updated 2 weeks ago