Sta8is / FUTURISTLinks
[CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
☆39Updated 3 months ago
Alternatives and similar repositories for FUTURIST
Users that are interested in FUTURIST are comparing it to the libraries listed below
Sorting:
- Official Github Repo for GEM☆95Updated last month
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆129Updated 8 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆184Updated 2 months ago
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆61Updated 8 months ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆48Updated 9 months ago
- ☆127Updated 10 months ago
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆84Updated 11 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆111Updated 9 months ago
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆63Updated 10 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆53Updated 10 months ago
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆75Updated last year
- This is the official project repository for "DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diff…☆32Updated 2 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆79Updated 11 months ago
- GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving☆25Updated 8 months ago
- [ICRA 2025] Official implementation for "TrackOcc: Camera-based 4D Panoptic Occupancy Tracking"☆51Updated 4 months ago
- [AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024☆63Updated last year
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆112Updated 10 months ago
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆73Updated last month
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆64Updated last year
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆35Updated 10 months ago
- ☆102Updated last year
- ☆44Updated 3 weeks ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆60Updated 8 months ago
- ☆18Updated 7 months ago
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆248Updated last year
- [ICCV 2025] GaussRender: Learning 3D Occupancy with Gaussian Rendering (official repository)☆62Updated 4 months ago
- [NeurIPS 2024] TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight☆33Updated 4 months ago
- [3DV 2026] Open Vocabulary Monocular 3D Object Detection☆63Updated 6 months ago
- ☆38Updated last year
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆49Updated last year