(AAAI 2025)MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
☆37May 21, 2025Updated 11 months ago
Alternatives and similar repositories for MUSES
Users that are interested in MUSES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- UniVid: The Open-Source Unified Video Model☆32Oct 13, 2025Updated 6 months ago
- ☆146Feb 28, 2026Updated 2 months ago
- An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].☆14Jul 27, 2024Updated last year
- Omni Controllable Video Diffusion☆45Dec 22, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of "Robust Zero Level-Set Extraction from Unsigned Distance Fields Based on Double Covering"☆44Aug 6, 2024Updated last year
- ☆28Apr 25, 2025Updated last year
- Retargeting of whole-body human motion to humanoid robots for dexterous manipulation of articulated objects.☆30Jan 28, 2026Updated 3 months ago
- [ICLR 26] DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆41Aug 3, 2025Updated 8 months ago
- [AAAI 2026 Poster] TOSC: Task-Oriented Shape Completion for Open-World Dexterous Grasp Generation from Partial Point Clouds☆24Feb 2, 2026Updated 3 months ago
- [AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities☆15Apr 26, 2024Updated 2 years ago
- Speedy MASt3R repo☆16Sep 25, 2025Updated 7 months ago
- [SIGGRAPH 2025] Official Implementation of "Instant Self-Intersection Repair for 3D Meshes"☆47Mar 26, 2026Updated last month
- The official code of ’AdaFuse: Adaptive Medical Image Fusion Based on Spatial-Frequential Cross Attention‘.☆12Dec 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- EgoBody3M Egocentric Body Tracking on a VR Headset using a Diverse Dataset☆24Oct 1, 2024Updated last year
- Multi-Sensor Place Recognition with Visual and Text Semantics☆21May 27, 2025Updated 11 months ago
- A list of works on video generation towards world model☆469Mar 21, 2026Updated last month
- [ICLR 2024] Neural Processing of Tri-Plane Hybrid Neural Fields☆15Feb 21, 2026Updated 2 months ago
- Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…☆90Jun 26, 2025Updated 10 months ago
- ☆15Jan 31, 2019Updated 7 years ago
- [NN 2024] Code Release of Unsupervised Distribution-aware Keypoints Generation from 3D Point Clouds☆11Feb 20, 2024Updated 2 years ago
- OneWorld: Taming Scene Generation with 3D Unified Representation Autoencoder☆58Mar 24, 2026Updated last month
- Mesh generation from sparse matrices☆23Nov 5, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆65Oct 15, 2024Updated last year
- ☆34Jan 27, 2026Updated 3 months ago
- ☆43Mar 9, 2026Updated last month
- ☆22Jun 20, 2025Updated 10 months ago
- A Powerful LoRA key converter for ComfyUI☆28Nov 17, 2025Updated 5 months ago
- Re-implementation of VertexRegen [ICCV 25]☆40Jan 25, 2026Updated 3 months ago
- Using Kolmogorov Arnold Networks (KANs) instead of MLPs in PointNet for Classification and Segmentation of 3D Point Sets☆16Apr 23, 2026Updated last week
- [CVPR25] SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs☆18Aug 27, 2025Updated 8 months ago
- Code for "Cross-modal Learning for Image-Guided Point Cloud Shape Completion" (NeurIPS 2022)☆67Jun 17, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A modern, interactive template for scientific writing☆29Apr 10, 2026Updated 3 weeks ago
- [CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…☆81Feb 26, 2026Updated 2 months ago
- A globally sparse but locally dense 3D feature renderer for camera relocalization.☆13Apr 16, 2025Updated last year
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆45Jan 27, 2026Updated 3 months ago
- [ICCV 2025] AnyI2V: Animating Any Conditional Image with Motion Control Generation☆123Aug 24, 2025Updated 8 months ago
- Paper_Copilot 是一款基于向量索引和大模型的高级文献分析命令行工具,旨在帮助学术研究人员高效管理、检索和分析海量文献。通过本地自建知识库并与大模型的交互,它能够为用户提供专业且精准的解答,显著提升文献研究的效率与准确性。☆25Oct 15, 2024Updated last year
- AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models☆139Jan 6, 2026Updated 3 months ago