littlepure2333 / GFlowLinks
[AAAI 2025] GFlow: Recovering 4D World from Monocular Video
☆43Updated last month
Alternatives and similar repositories for GFlow
Users that are interested in GFlow are comparing it to the libraries listed below
Sorting:
- Self-reimplemented version of 4D-LRM.☆30Updated 3 weeks ago
- [ICLR 25'] InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting☆20Updated 2 months ago
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆58Updated 2 weeks ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆179Updated this week
- ☆34Updated last year
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆92Updated 3 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆36Updated 4 months ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆17Updated 3 weeks ago
- ☆38Updated 11 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆42Updated last month
- DreamGaussian with 2D-GS☆12Updated 8 months ago
- [ICLR 2024] This is the official implementation of our paper "Semantic Flow: Learning Semantic Fields of Dynamic Scenes from Monocular Vi…☆11Updated 9 months ago
- ☆54Updated 3 months ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆34Updated 3 weeks ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆61Updated 2 months ago
- [CVPR 2024 Highlight] GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding☆27Updated 11 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆83Updated 2 months ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated 9 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆48Updated 3 weeks ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆123Updated last month
- (CVPR 2024) NViST: In the wild New View Synthesis from a Single Image with Transformers☆41Updated 9 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆45Updated last week
- [CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆30Updated 2 months ago
- ☆47Updated last month
- ☆16Updated 2 months ago
- ☆23Updated last month
- Official implementation of "D^2USt3R: Enhancing 3D Reconstruction with 4D Pointmaps for Dynamic Scenes"☆51Updated 2 months ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆63Updated 2 weeks ago
- ☆21Updated 3 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year