Sta8is / DINO-ForesightLinks
[NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO
☆117Updated last month
Alternatives and similar repositories for DINO-Foresight
Users that are interested in DINO-Foresight are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆182Updated last month
- Official implementation of DepthLM☆229Updated 3 weeks ago
- Unifying 2D and 3D Vision-Language Understanding☆115Updated 3 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆72Updated last month
- ☆109Updated 2 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆54Updated 8 months ago
- ☆24Updated 7 months ago
- ☆45Updated 4 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆103Updated 7 months ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆186Updated 6 months ago
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆79Updated last year
- ☆41Updated last year
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆115Updated 7 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆95Updated 8 months ago
- SceneFun3D ToolKit☆157Updated 6 months ago
- [CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers☆39Updated 2 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆122Updated last year
- Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion (ICCV 2025)☆68Updated last month
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆215Updated 3 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆300Updated last month
- Generative World Explorer☆158Updated 4 months ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆346Updated last month
- [ICCV'25 Oral] The official implementation of Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion☆53Updated 3 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆329Updated last year
- Trace Anything: Representing Any Video in 4D via Trajectory Fields☆296Updated 2 weeks ago
- Code repository for "DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers"☆59Updated 4 months ago
- ☆35Updated 5 months ago
- ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding☆14Updated 2 months ago
- [NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS☆130Updated 2 weeks ago