Sta8is / DINO-Foresight
Official Implementation of DINO-Foresight: Looking into the Future with DINO
☆43Updated last week
Alternatives and similar repositories for DINO-Foresight:
Users that are interested in DINO-Foresight are comparing it to the libraries listed below
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆57Updated 8 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆69Updated 2 months ago
- Implementation of Zero-Shot Video Semantic Segmentation☆36Updated 6 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆74Updated 2 weeks ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆80Updated 9 months ago
- Diffusion Models as Data Mining Tools☆53Updated 4 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆291Updated 7 months ago
- ☆36Updated last year
- ☆58Updated last year
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆100Updated 9 months ago
- 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.☆39Updated last month
- ☆41Updated last month
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆25Updated last month
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆93Updated 6 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆32Updated 8 months ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆25Updated 6 months ago
- [arXiv 2024] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆16Updated 2 weeks ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 4 months ago
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆68Updated this week
- SceneFun3D ToolKit☆105Updated this week
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated 8 months ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆39Updated last month
- ☆15Updated 3 months ago
- ☆87Updated last month
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated 4 months ago
- ☆73Updated last month
- ☆41Updated 3 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆36Updated 2 months ago
- Official code for "DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut", NeurIPS 202…☆37Updated last month