Sta8is / DINO-Foresight
Official Implementation of DINO-Foresight: Looking into the Future with DINO
☆51Updated 2 months ago
Alternatives and similar repositories for DINO-Foresight
Users that are interested in DINO-Foresight are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers☆26Updated 2 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆46Updated 2 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆64Updated 11 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- ☆59Updated last month
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆88Updated 11 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆100Updated last month
- Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models [CVPR 2025]☆61Updated last month
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆307Updated 10 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆39Updated last week
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆77Updated 5 months ago
- Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆32Updated 2 weeks ago
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆37Updated 3 months ago
- ☆90Updated 4 months ago
- Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆120Updated this week
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆57Updated 2 months ago
- VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).☆85Updated 2 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆38Updated 5 months ago
- DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence (WACV 2025)☆22Updated 3 months ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆27Updated 2 months ago
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆26Updated 2 months ago
- The official implementation of Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion☆38Updated 2 months ago
- ☆37Updated last year
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated last month
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆91Updated 3 months ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆156Updated 2 weeks ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆118Updated last year
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆75Updated last year
- ☆61Updated last year
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Updated 3 months ago