Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
☆88Feb 25, 2026Updated 3 weeks ago
Alternatives and similar repositories for EASI
Users that are interested in EASI are comparing it to the libraries listed below
Sorting:
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆53Dec 28, 2025Updated 2 months ago
- The official repository of the first version of ACE-Brain foundation model.☆62Mar 13, 2026Updated last week
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆21Mar 5, 2026Updated 2 weeks ago
- ☆13Updated this week
- Rui Zhu's implementation of CVPR2020 work Inverse Rendering for Complex Indoor Scene by Li et.al☆13Jan 17, 2023Updated 3 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 7 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆36Jan 16, 2026Updated 2 months ago
- ☆16Oct 12, 2025Updated 5 months ago
- ☆24Dec 24, 2025Updated 2 months ago
- Unified Codebase for Advanced World Models.☆136Updated this week
- [CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆518Mar 1, 2026Updated 3 weeks ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆23Mar 2, 2026Updated 2 weeks ago
- ☆37Dec 16, 2025Updated 3 months ago
- Internal utility libraries for Pkl☆16Mar 10, 2026Updated last week
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated last week
- Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training☆143Mar 13, 2026Updated last week
- ☆30Jan 15, 2026Updated 2 months ago
- WorldSense benchmark for grounded reasoning in language models☆24Nov 28, 2023Updated 2 years ago
- ☆61Oct 25, 2025Updated 4 months ago
- A local AI assistant running on your device. It turns your files into actionable memory.☆54Mar 14, 2026Updated last week
- ☆32Mar 13, 2026Updated last week
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆26Jul 15, 2025Updated 8 months ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆40Jul 5, 2025Updated 8 months ago
- Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights☆32Jan 9, 2026Updated 2 months ago
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆132Feb 21, 2026Updated last month
- Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"☆53Jul 5, 2025Updated 8 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆45Mar 2, 2026Updated 2 weeks ago
- ☆21Dec 3, 2025Updated 3 months ago
- d3LLM: Ultra-Fast Diffusion LLM 🚀☆110Mar 15, 2026Updated last week
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆63Jan 23, 2026Updated last month
- DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data☆46Dec 12, 2025Updated 3 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆180Feb 24, 2026Updated 3 weeks ago
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆17Nov 11, 2025Updated 4 months ago
- The official implementation of the paper The Change You Want to See (Now in 3D) (ICCVW 2023).☆29Jul 5, 2024Updated last year
- ☆10Jul 6, 2021Updated 4 years ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆24Oct 19, 2025Updated 5 months ago
- ☆48Jul 6, 2025Updated 8 months ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆44Updated this week
- PyTorch Implementation for InMaP☆11Oct 28, 2023Updated 2 years ago