Sta8is / FUTURISTLinks
[CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
☆29Updated 3 months ago
Alternatives and similar repositories for FUTURIST
Users that are interested in FUTURIST are comparing it to the libraries listed below
Sorting:
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆54Updated 4 months ago
- FLOSS: Plug-in Training-free and label-free text template selection that boosts OVSS methods☆21Updated 2 months ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆49Updated this week
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆29Updated 11 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆30Updated 2 weeks ago
- ☆83Updated 2 months ago
- ☆53Updated last year
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆17Updated 3 weeks ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆80Updated 7 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆57Updated 3 months ago
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆58Updated 2 weeks ago
- ☆101Updated 7 months ago
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆124Updated last week
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆125Updated 2 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆34Updated 5 months ago
- ☆37Updated 11 months ago
- Official Github Repo for GEM☆65Updated last week
- ☆38Updated 11 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆92Updated 3 weeks ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆33Updated 3 months ago
- Downstream semantic segmentation evaluation of DGInStyle.☆25Updated last year
- ☆87Updated 5 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆156Updated 2 weeks ago
- PyTorch code and models for ScaLR image-to-lidar distillation method☆52Updated 11 months ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆81Updated 6 months ago
- [AAAI 2024-Oral] EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder☆31Updated last year
- [ECCV 2024] The official PyTorch implementation of the "Part2Object: Hierarchical Unsupervised 3D Instance Segmentation".☆22Updated 9 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆120Updated last year
- ☆25Updated 2 weeks ago