Sta8is / FUTURISTLinks
[CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
☆28Updated 2 months ago
Alternatives and similar repositories for FUTURIST
Users that are interested in FUTURIST are comparing it to the libraries listed below
Sorting:
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆52Updated 3 months ago
- Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆150Updated 3 weeks ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆79Updated 6 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆56Updated 3 months ago
- Downstream semantic segmentation evaluation of DGInStyle.☆25Updated last year
- ☆53Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆46Updated 4 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆119Updated last year
- Official Github Repo for GEM☆60Updated last month
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆29Updated 10 months ago
- ☆69Updated 2 months ago
- Code for "Open Vocabulary Monocular 3D Object Detection"☆49Updated last month
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆86Updated last week
- VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).☆92Updated this week
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆146Updated 2 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆66Updated 11 months ago
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆125Updated 2 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆43Updated last month
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆101Updated 2 months ago
- FLOSS: Plug-in Training-free and label-free text template selection that boosts OVSS methods☆21Updated last month
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆28Updated 9 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆105Updated 4 months ago
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆58Updated 8 months ago
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆77Updated last year
- ☆101Updated 6 months ago
- SceneFun3D ToolKit☆136Updated last month
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆34Updated 4 months ago
- ☆38Updated 10 months ago
- [AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024☆58Updated last year