Sta8is / DINO-ForesightLinks
Official Implementation of DINO-Foresight: Looking into the Future with DINO
☆52Updated 3 months ago
Alternatives and similar repositories for DINO-Foresight
Users that are interested in DINO-Foresight are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers☆28Updated 2 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆43Updated last month
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆46Updated 3 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆66Updated 11 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆101Updated 2 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆68Updated last year
- ☆69Updated 2 months ago
- Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆150Updated 3 weeks ago
- VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).☆92Updated this week
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆79Updated 6 months ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆88Updated last year
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆311Updated 10 months ago
- ☆37Updated last year
- Official code for "DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut", NeurIPS 202…☆38Updated 4 months ago
- ☆20Updated 2 months ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆27Updated 3 months ago
- Diffusion Models as Data Mining Tools☆54Updated 3 weeks ago
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆20Updated 5 months ago
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆157Updated 2 weeks ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆162Updated last month
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆24Updated 8 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆277Updated 3 months ago
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated 11 months ago
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆45Updated last year
- [ICLR 2025] Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆38Updated last month
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆41Updated this week
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆115Updated this week
- Unifying 2D and 3D Vision-Language Understanding☆82Updated last month
- SceneFun3D ToolKit☆136Updated last month