tsinghua-fib-lab / UrbanLLaVAView external linksLinks
[ICCV 2025] UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoing and Understanding.
☆68Oct 6, 2025Updated 4 months ago
Alternatives and similar repositories for UrbanLLaVA
Users that are interested in UrbanLLaVA are comparing it to the libraries listed below
Sorting:
- [KDD 2025 Research] CityGPT: Empowering Urban Spatial Cognition of Large Language Models.☆48Jul 15, 2025Updated 7 months ago
- [KDD 2025 D&B] CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks.☆47Jul 15, 2025Updated 7 months ago
- ☆46Oct 2, 2025Updated 4 months ago
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆26Jul 15, 2025Updated 7 months ago
- A Multi-graph Multi-head Adaptive Temporal Graph Convolutional Network☆11May 21, 2023Updated 2 years ago
- ☆17Apr 17, 2025Updated 10 months ago
- ☆11Feb 5, 2024Updated 2 years ago
- Official implementation of the ICCV 2025 paper HoliTracer.☆40Jan 13, 2026Updated last month
- ☆13Jun 13, 2024Updated last year
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Oct 5, 2025Updated 4 months ago
- ☆111Sep 15, 2025Updated 5 months ago
- ☆14Mar 31, 2021Updated 4 years ago
- [ICLR 2026] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models☆79Jan 21, 2026Updated 3 weeks ago
- ☆43Jan 17, 2024Updated 2 years ago
- Empower Traffic Simulation via Foundation Model☆24Oct 9, 2023Updated 2 years ago
- Knowledge Graph Large Language Model (KG-LLM)☆36Jun 23, 2024Updated last year
- Official code for VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator☆108Feb 9, 2026Updated last week
- [ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model☆82Jul 24, 2025Updated 6 months ago
- ☆23Apr 19, 2024Updated last year
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆62Nov 8, 2025Updated 3 months ago
- This is the official repo of OpenSatMap in NeurIPS 2024 D&B Track☆29Jul 6, 2025Updated 7 months ago
- Satellite-Ground Fusion for 3D Semantic Scene Completion☆28Sep 8, 2025Updated 5 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 7 months ago
- ☆69Oct 16, 2025Updated 4 months ago
- [ACM MM 25] Official repo of "UEMM-Air: Enable UAVs to Undertake More Multi-modal Tasks"☆33Aug 20, 2025Updated 5 months ago
- ☆38Jul 14, 2025Updated 7 months ago
- ☆29Apr 23, 2025Updated 9 months ago
- [ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis☆110Nov 3, 2025Updated 3 months ago
- Official implementation of the paper "Geographic mapping with unsupervised multi-modal representation learning from VHR images and POIs"☆27Sep 27, 2023Updated 2 years ago
- Awesome paper list and repos of the paper "A comprehensive survey of embodied world models".☆68Oct 22, 2025Updated 3 months ago
- Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).☆66Dec 26, 2025Updated last month
- [NAACL 2025 Main] AgentMove: A Large Language Model based Agentic Framework for Zero-shot Next Location Prediction.☆43Jul 26, 2025Updated 6 months ago
- [IJCV 2024] MoDA: Modeling Deformable 3D Objects from Casual Videos☆33Jan 14, 2025Updated last year
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆82Oct 17, 2025Updated 4 months ago
- Spatio-temporal modeling 论文列表(主要是graph convolution相关)☆24Nov 7, 2019Updated 6 years ago
- An Awesome Collection of Urban Foundation Models (UFMs).☆208Jan 11, 2026Updated last month
- ☆26Aug 6, 2025Updated 6 months ago
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆36Apr 10, 2025Updated 10 months ago
- [ICLR 2025 Oral] NeuralPlane: Structured 3D Reconstruction in Planar Primitives with Neural Fields☆57Jan 3, 2026Updated last month