KempnerInstitute / dinovisionLinks
☆25Updated 4 months ago
Alternatives and similar repositories for dinovision
Users that are interested in dinovision are comparing it to the libraries listed below
Sorting:
- Clarity: A Minimalist Website Template for AI Research☆183Updated last year
- [NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO☆147Updated 2 months ago
- ☆114Updated last week
- Implementation of Danijar's latest iteration for his Dreamer line of work☆161Updated 2 weeks ago
- This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning mo…☆95Updated 3 months ago
- ☆186Updated 2 months ago
- Unifying 2D and 3D Vision-Language Understanding☆121Updated 6 months ago
- [NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting☆70Updated last month
- World Modeling by Forecasting Vision Foundation Model Features☆32Updated last month
- An open source library designed to provide community examples of Joint Embedding Predictive Architectures (JEPAs). It contains code and e…☆232Updated last week
- Code implementation of the paper "World-in-World: World Models in a Closed-Loop World"☆124Updated last month
- [ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation☆56Updated 4 months ago
- Block-Recurrent Dynamics in ViTs 🦖☆24Updated last month
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆83Updated 3 weeks ago
- ☆170Updated 3 months ago
- ☆124Updated 5 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆58Updated 11 months ago
- Code and data for "Does Spatial Cognition Emerge in Frontier Models?"☆27Updated 9 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆104Updated 10 months ago
- Official implementation of ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis☆36Updated 8 months ago
- [ICML 2025] Implementation of Spatial Reasoning with Denoising Models☆86Updated 6 months ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆151Updated last month
- [CVPR 2025] Program synthesis for 3D spatial reasoning☆56Updated 7 months ago
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆216Updated 2 months ago
- ☆49Updated 7 months ago
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆218Updated 11 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆126Updated 3 months ago
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆18Updated last year
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆249Updated 3 weeks ago
- Public release of the code for "Accelerating Vision Transformers with Adaptive Patches"☆90Updated 3 months ago