SkyworkAI / MVGamba
[NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
☆57Updated 4 months ago
Alternatives and similar repositories for MVGamba:
Users that are interested in MVGamba are comparing it to the libraries listed below
- The official PyTorch implementation of Diffusion Time-step Curriculum for One Image to 3D Generation (CVPR 2024)☆76Updated 10 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)☆79Updated last month
- Official repository of MMGenBench☆119Updated last month
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆118Updated 5 months ago
- Gotta Hear Them All: Sound Source Aware Vision to Audio Generation.☆60Updated last month
- Efficient controlnet for DiTs☆170Updated this week
- Official implementation of paper "Multi-Level Collaboration in Model Merging"☆40Updated 2 weeks ago
- 📌 [Arxiv2025] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆163Updated 2 weeks ago
- ☆153Updated last year
- The repository for 'Tri$^{2}$-plane: Volumetric Avatar Reconstruction with Feature Pyramid'☆138Updated 2 months ago
- ☆160Updated 6 months ago
- [ICLR 2025] Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling☆75Updated last month
- Self-Supervised Pre-training for 3D Point Clouds via View-Specific Point-to-Image Translation☆20Updated 4 months ago
- Wan2.1 with Controlnet☆158Updated 3 weeks ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆498Updated last week
- ☆69Updated last month
- Efficient DiT architecture for text2any tasks, ICLR2025☆421Updated 2 months ago
- ☆27Updated 5 months ago
- Text-to-3D Generation by 2D Editing☆65Updated last month
- Official PyTorch Implementation of Habitizing Diffusion Planning for Efficient and Effective Decision Making☆28Updated 2 months ago
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆64Updated 2 months ago
- ☆244Updated 3 months ago
- Stereo Any Video: Temporally Consistent Stereo Matching☆127Updated 3 weeks ago
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆111Updated 3 months ago
- hybrid sfm with VIO Pose,RGB and depth data☆52Updated last year
- [🔨software] Sketch-based tree modeling software, implemented in C++ and OpenGL.☆109Updated last month
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆48Updated 8 months ago
- ☆207Updated last week
- Run JavaScript code from Python.☆101Updated last month
- RegGeoNet: Learning Regular Representations for Large-Scale 3D Point Clouds☆36Updated 4 months ago