DINGYANB/MUSES

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DINGYANB/MUSES)

DINGYANB / MUSES

（AAAI 2025）MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration

☆37

Alternatives and similar repositories for MUSES

Users that are interested in MUSES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

markywg / transagent
View on GitHub
[NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
☆25Oct 17, 2024Updated last year
zhuangshaobin / Video-GPT
View on GitHub
[ICLR2026] Video-GPT via Next Clip Diffusion.
☆46Jun 2, 2025Updated last year
Tele-AI / OmniVDiff
View on GitHub
Omni Controllable Video Diffusion
☆46Dec 22, 2025Updated 7 months ago
gfmei / GeoZe
View on GitHub
☆29Updated this week
amazon-science / PIXELS
View on GitHub
Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"
☆11Dec 17, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
arthurchen0518 / FoundHand
View on GitHub
☆17Jun 29, 2025Updated last year
zhangbw17 / MV-Adapter
View on GitHub
An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].
☆14Jul 27, 2024Updated last year
SAIS-FUXI / Omni-Video
View on GitHub
☆157Feb 28, 2026Updated 4 months ago
vivoCameraResearch / Hyper-Motion
View on GitHub
HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.
☆154May 27, 2026Updated last month
tanABCC / VABench
View on GitHub
☆16Jul 8, 2026Updated 2 weeks ago
join16 / COD-VAE
View on GitHub
Representing 3D Shapes with 64 Latent Vectors for 3D Diffusion Models
☆26Sep 15, 2025Updated 10 months ago
microsoft / AVGen-Bench
View on GitHub
[ICML26] AVGen-Bench is a task-driven benchmark for multi-granular evaluation of Text-to-Audio-Video (T2AV) generation.
☆22Jul 2, 2026Updated 3 weeks ago
hzphzp / WeGen
View on GitHub
☆27Apr 25, 2025Updated last year
THUKElab / MESED
View on GitHub
[AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities
☆15Apr 26, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
xianming-gu / AdaFuse
View on GitHub
The official code of ’AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention‘.
☆12Dec 11, 2024Updated last year
baaivision / MTVCraft
View on GitHub
MTVCraft: An Open Veo3-style Audio-Video Generation Demo
☆98Oct 8, 2025Updated 9 months ago
alexmelekhin / MSSPlace
View on GitHub
Multi-Sensor Place Recognition with Visual and Text Semantics
☆23May 27, 2025Updated last year
liuxiaoyu1104 / AnimateAnywhere
View on GitHub
[TMM 2026] Rouse the Background in Human Image Animation
☆30Apr 24, 2025Updated last year
OPPO-Mente-Lab / X2I
View on GitHub
Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…
☆89Jun 26, 2025Updated last year
Chenguoz / Keypoints
View on GitHub
[NN 2024] Code Release of Unsupervised Distribution-aware Keypoints Generation from 3D Point Clouds
☆11Feb 20, 2024Updated 2 years ago
CVLAB-Unibo / triplane_processing
View on GitHub
[ICLR 2024] Neural Processing of Tri-Plane Hybrid Neural Fields
☆15Feb 21, 2026Updated 5 months ago
jerry4h / Face_Xray
View on GitHub
A 3rd-party implemented Face-Xray for deepfake detection.
☆13Jun 2, 2020Updated 6 years ago
Bob-cheng / CL-FusionAttack
View on GitHub
The Pytorch implementation for the paper "Fusion is Not Enough: Single Modal Attack on Fusion Models for 3D Object Detection"
☆20Mar 9, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
QinRui-k / GCN-CIS
View on GitHub
☆14May 27, 2024Updated 2 years ago
DiT-3D / FastDiT-3D
View on GitHub
☆14Mar 23, 2024Updated 2 years ago
tudo-seal / CLS-CAD
View on GitHub
Automated CAD assembly generation based on Combinatory Logic Synthesis.
☆15Jul 7, 2026Updated 2 weeks ago
Zheng222 / VideoHDR
View on GitHub
☆17Jun 17, 2020Updated 6 years ago
QinRui-k / MVC-Net
View on GitHub
☆16Sep 26, 2024Updated last year
Ali-Stanford / PointNet_KAN_Graphic
View on GitHub
Using Kolmogorov Arnold Networks (KANs) instead of MLPs in PointNet for Classification and Segmentation of 3D Point Sets
☆15Apr 23, 2026Updated 3 months ago
zx1239856 / VertexRegen
View on GitHub
Re-implementation of VertexRegen [ICCV 25]
☆41Jan 25, 2026Updated 6 months ago
SpatiaOS / P3D-Bench
View on GitHub
Benchmarking MLLMs for Parametric 3D Generation and Structural Reasoning (Text-to-3D, Image-to-3D, Assembly-3D)
☆46Updated this week
jjjkkyz / DCUDF
View on GitHub
Implementation of "Robust Zero Level-Set Extraction from Unsigned Distance Fields Based on Double Covering"
☆44Jun 3, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Yanbo-23 / Proto-Comp
View on GitHub
☆19Nov 18, 2024Updated last year
eliphatfs / cumesh2sdf
View on GitHub
Mesh to SDF implemented with CUDA.
☆49Aug 2, 2024Updated last year
1zb / functional-diffusion
View on GitHub
☆67Oct 15, 2024Updated last year
zhangguiwei610 / V2Flow
View on GitHub
☆29Mar 30, 2025Updated last year
AIGeeksGroup / UniVid
View on GitHub
UniVid: The Open-Source Unified Video Model
☆32Oct 13, 2025Updated 9 months ago
VisionXLab / AdapTok
View on GitHub
[CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space
☆29Mar 15, 2026Updated 4 months ago
yaorong0921 / DynStatF
View on GitHub
This repo includes code for the paper "DynStatF: An Efficient Feature Fusion Strategy for LiDAR 3D Object Detection"
☆24Dec 20, 2023Updated 2 years ago