KempnerInstitute / dinovisionLinks
☆23Updated 4 months ago
Alternatives and similar repositories for dinovision
Users that are interested in dinovision are comparing it to the libraries listed below
Sorting:
- Clarity: A Minimalist Website Template for AI Research☆177Updated last year
- This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning mo…☆93Updated 3 months ago
- ☆110Updated last month
- ☆182Updated 2 months ago
- [NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO☆146Updated 2 months ago
- Unifying 2D and 3D Vision-Language Understanding☆119Updated 6 months ago
- Implementation of Danijar's latest iteration for his Dreamer line of work☆158Updated this week
- Code implementation of the paper "World-in-World: World Models in a Closed-Loop World"☆124Updated last month
- Code, data and weights for the paper **What drives success in physical planning with Joint-Embedding Predictive World Models?**☆111Updated 3 weeks ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆83Updated last week
- [CVPR 2025] Program synthesis for 3D spatial reasoning☆54Updated 7 months ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆152Updated last month
- Code and data for "Does Spatial Cognition Emerge in Frontier Models?"☆27Updated 9 months ago
- ☆178Updated this week
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆72Updated 2 years ago
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆249Updated 2 weeks ago
- [NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting☆70Updated 3 weeks ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆125Updated 2 months ago
- TIPS (ICLR'25): Text-Image Pretraining with Spatial Awareness☆115Updated 9 months ago
- ☆122Updated 5 months ago
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆216Updated 2 months ago
- ☆49Updated 7 months ago
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆18Updated 11 months ago
- This is the official code release for [LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors](https://arxiv…☆44Updated last year
- [ICML 2025] Implementation of Spatial Reasoning with Denoising Models☆86Updated 6 months ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆194Updated 7 months ago
- ☆35Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- Official implementation of DepthLM☆290Updated this week
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆68Updated 2 years ago