Sta8is / DINO-ForesightLinks
Official Implementation of DINO-Foresight: Looking into the Future with DINO
☆59Updated 5 months ago
Alternatives and similar repositories for DINO-Foresight
Users that are interested in DINO-Foresight are comparing it to the libraries listed below
Sorting:
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆52Updated 5 months ago
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆147Updated 3 weeks ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆68Updated last year
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆63Updated last month
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆111Updated 4 months ago
- ☆95Updated 4 months ago
- [CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers☆34Updated last month
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆82Updated 8 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆314Updated last year
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆68Updated last year
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆54Updated last month
- Generative World Explorer☆150Updated last month
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆103Updated 4 months ago
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆46Updated last month
- Unifying 2D and 3D Vision-Language Understanding☆98Updated 2 weeks ago
- ☆43Updated last month
- Visualization of the PCA as shown in Figure 1.☆33Updated last year
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆168Updated 3 months ago
- ☆29Updated 2 months ago
- Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆70Updated last week
- ☆23Updated 4 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆286Updated 5 months ago
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆178Updated 2 weeks ago
- LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS☆91Updated 3 weeks ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆95Updated 6 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆120Updated last year
- ☆102Updated 4 months ago
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆79Updated last year
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆48Updated 3 months ago
- Program synthesis for 3D spatial reasoning☆44Updated last month