(AAAI 2025)MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
☆41May 21, 2025Updated 10 months ago
Alternatives and similar repositories for MUSES
Users that are interested in MUSES are comparing it to the libraries listed below
Sorting:
- Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.☆278Feb 3, 2026Updated last month
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- [ICLR2026] WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction☆63Sep 3, 2025Updated 6 months ago
- ☆130Feb 28, 2026Updated 3 weeks ago
- Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"☆11Dec 17, 2024Updated last year
- LHM++: An Efficient Large Human Reconstruction Model for Pose-free Images to 3D☆70Mar 16, 2026Updated last week
- Implementation of "Robust Zero Level-Set Extraction from Unsigned Distance Fields Based on Double Covering"☆42Aug 6, 2024Updated last year
- Omni Controllable Video Diffusion☆42Dec 22, 2025Updated 3 months ago
- An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].☆14Jul 27, 2024Updated last year
- [CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space☆24Mar 15, 2026Updated last week
- ☆28Dec 17, 2025Updated 3 months ago
- ☆27Apr 25, 2025Updated 10 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- [AAAI 2026 Poster] TOSC: Task-Oriented Shape Completion for Open-World Dexterous Grasp Generation from Partial Point Clouds☆21Feb 2, 2026Updated last month
- Speedy MASt3R repo☆15Sep 25, 2025Updated 5 months ago
- [🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …☆649Feb 27, 2026Updated 3 weeks ago
- ☆30Apr 24, 2025Updated 10 months ago
- EgoBody3M Egocentric Body Tracking on a VR Headset using a Diverse Dataset☆22Oct 1, 2024Updated last year
- The official code of ’AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention‘.☆12Dec 11, 2024Updated last year
- A list of works on video generation towards world model☆433Updated this week
- Multi-Sensor Place Recognition with Visual and Text Semantics☆21May 27, 2025Updated 9 months ago
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆40Jan 27, 2026Updated last month
- ☆14Jan 31, 2019Updated 7 years ago
- Dataset and codes will be released soon.☆15Oct 26, 2023Updated 2 years ago
- Mesh generation from sparse matrices☆23Nov 5, 2025Updated 4 months ago
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…☆90Jun 26, 2025Updated 8 months ago
- A 3rd-party implemented Face-Xray for deepfake detection.☆13Jun 2, 2020Updated 5 years ago
- Code of Dispel Darkness for Better Fusion: A Controllable Visual Enhancer based on Cross-modal Conditional Adversarial Learning.☆19Jun 24, 2024Updated last year
- ☆65Oct 15, 2024Updated last year
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching☆20Apr 21, 2025Updated 11 months ago
- A Powerful LoRA key converter for ComfyUI☆28Nov 17, 2025Updated 4 months ago
- ☆14Mar 23, 2024Updated 2 years ago
- ☆21Jun 20, 2025Updated 9 months ago
- Repository for "Echoes of the Coliseum: Towards 3D Live streaming of Sports Events"☆27Sep 4, 2025Updated 6 months ago
- ☆17Jun 17, 2020Updated 5 years ago
- [CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…☆77Feb 26, 2026Updated 3 weeks ago
- ☆14May 27, 2024Updated last year
- [ICLR2026] Video-GPT via Next Clip Diffusion.☆44Jun 2, 2025Updated 9 months ago
- [CVPR25] SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs☆19Aug 27, 2025Updated 6 months ago