Sta8is / FUTURIST
[CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
☆26Updated 2 months ago
Alternatives and similar repositories for FUTURIST
Users that are interested in FUTURIST are comparing it to the libraries listed below
Sorting:
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆51Updated 2 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆55Updated 2 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆64Updated 11 months ago
- Official Github Repo for GEM☆52Updated 2 weeks ago
- Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆120Updated this week
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆100Updated last month
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆28Updated 9 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆118Updated last year
- PyTorch code and models for ScaLR image-to-lidar distillation method☆50Updated 10 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆77Updated 5 months ago
- ☆99Updated 5 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆91Updated 3 months ago
- FLOSS: Plug-in Training-free and label-free text template selection that boosts OVSS methods☆20Updated 2 weeks ago
- SceneFun3D ToolKit☆135Updated 3 weeks ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆29Updated 9 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆28Updated 8 months ago
- Downstream semantic segmentation evaluation of DGInStyle.☆25Updated last year
- ☆59Updated last month
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆88Updated 11 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆39Updated last week
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆21Updated last week
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆31Updated 2 months ago
- VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).☆85Updated 2 months ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆58Updated 2 months ago
- ☆84Updated 4 months ago
- Official implementation of "Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive" (ICLR 2024)☆54Updated 8 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆32Updated 3 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆78Updated last month
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 5 months ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆43Updated 3 months ago