Sta8is / FUTURISTLinks
[CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
☆35Updated last week
Alternatives and similar repositories for FUTURIST
Users that are interested in FUTURIST are comparing it to the libraries listed below
Sorting:
- Official Github Repo for GEM☆83Updated last week
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 8 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆162Updated last month
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆58Updated 5 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆109Updated 6 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆127Updated 6 months ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆41Updated 6 months ago
- ☆113Updated 7 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆57Updated 5 months ago
- GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving☆22Updated 5 months ago
- ☆103Updated 9 months ago
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆56Updated 7 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆81Updated 3 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆46Updated 11 months ago
- [NeurIPS 2024] TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight☆32Updated last month
- PyTorch code and models for ScaLR image-to-lidar distillation method☆55Updated last month
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆107Updated 7 months ago
- ☆38Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated last year
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆73Updated 8 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆99Updated 7 months ago
- ☆52Updated 8 months ago
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆50Updated 6 months ago
- [NeurIPS 2024] DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features☆33Updated 8 months ago
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆60Updated last week
- [ICCV 2025] GaussRender: Learning 3D Occupancy with Gaussian Rendering (official repository)☆52Updated last month
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆75Updated last year
- FLOSS: Plug-in Training-free and label-free text template selection that boosts OVSS methods (ICCV 2025)☆25Updated last month
- ☆27Updated 11 months ago
- This is the official project repository for "DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diff…☆27Updated 3 months ago