littlepure2333 / GFlowLinks
[AAAI 2025] GFlow: Recovering 4D World from Monocular Video
☆46Updated 2 months ago
Alternatives and similar repositories for GFlow
Users that are interested in GFlow are comparing it to the libraries listed below
Sorting:
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆67Updated last month
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆17Updated last month
- Self-reimplemented version of 4D-LRM.☆47Updated last month
- Official Code of IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆33Updated 2 weeks ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆86Updated 3 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆81Updated 2 weeks ago
- ☆34Updated last year
- DreamGaussian with 2D-GS☆12Updated 9 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆43Updated 2 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆135Updated last month
- [CVPR 2025] UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting☆36Updated 2 weeks ago
- ☆54Updated 4 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆36Updated 5 months ago
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆93Updated 2 months ago
- ☆51Updated last month
- ☆55Updated this week
- [ACM MM 2025] EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler☆17Updated 3 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆20Updated 4 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆53Updated last week
- ☆94Updated 3 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆44Updated 3 weeks ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆214Updated 2 weeks ago
- Seeing World Dynamics in a Nutshell☆109Updated 4 months ago
- ☆22Updated 3 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆95Updated 4 months ago
- [ICLR 25'] InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting☆20Updated 3 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆63Updated 2 months ago
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆79Updated last year
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆142Updated last week
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆31Updated last month