Sta8is / DINO-ForesightLinks
Official Implementation of DINO-Foresight: Looking into the Future with DINO
☆54Updated 4 months ago
Alternatives and similar repositories for DINO-Foresight
Users that are interested in DINO-Foresight are comparing it to the libraries listed below
Sorting:
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆54Updated last week
- [CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers☆29Updated 3 months ago
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆124Updated last week
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆49Updated 4 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆80Updated 7 months ago
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆36Updated this week
- VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).☆94Updated 2 weeks ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆68Updated last year
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆38Updated 6 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆66Updated last year
- ☆38Updated last year
- ☆83Updated 2 months ago
- The official implementation of Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion☆40Updated 4 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆48Updated 3 weeks ago
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆78Updated last year
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆314Updated 11 months ago
- ☆91Updated last month
- ☆21Updated 3 months ago
- [ICLR 2025] Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆40Updated 2 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆102Updated 3 months ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆89Updated last year
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated last year
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆164Updated 2 months ago
- Public repository for the ECCV 2024 paper "Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation".☆25Updated 8 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆282Updated 4 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- ☆23Updated last month
- Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆156Updated 2 weeks ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆120Updated last year
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆71Updated last year