Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
β107Apr 30, 2026Updated this week
Alternatives and similar repositories for EASI
Users that are interested in EASI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository of the first version of ACE-Brain foundation model.β75Mar 13, 2026Updated last month
- [CVPR 2026 (Findings) π₯π₯] Self Evolving Large Multimodal Models with Continuous Rewardsβ23Mar 5, 2026Updated last month
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to buiβ¦β73Feb 27, 2026Updated 2 months ago
- β13Mar 18, 2026Updated last month
- Rui Zhu's implementation of CVPR2020 work Inverse Rendering for Complex Indoor Scene by Li et.alβ13Jan 17, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)β15Apr 8, 2026Updated 3 weeks ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decodingβ37Apr 25, 2026Updated last week
- The training codes of Jasper-Token-Compression-600Mβ19Nov 19, 2025Updated 5 months ago
- β63Nov 12, 2025Updated 5 months ago
- β20Oct 12, 2025Updated 6 months ago
- Vero: An Open RL Recipe for General Visual Reasoningβ114Apr 19, 2026Updated last week
- β25Apr 6, 2026Updated 3 weeks ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsβ68Mar 22, 2026Updated last month
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flexβ¦β26Apr 4, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotationsβ546Apr 22, 2026Updated last week
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Modelsβ31Mar 18, 2026Updated last month
- WorldSense benchmark for grounded reasoning in language modelsβ24Nov 28, 2023Updated 2 years ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.β56Mar 12, 2026Updated last month
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistantβ419Mar 19, 2025Updated last year
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urbanβ¦β28Jul 15, 2025Updated 9 months ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generationβ41Jul 5, 2025Updated 9 months ago
- β16Aug 21, 2019Updated 6 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewardsβ36Oct 3, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A local AI assistant running on your device. It turns your files into actionable memory.β55Mar 24, 2026Updated last month
- Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insightsβ32Jan 9, 2026Updated 3 months ago
- In our implementation of Qwen-Image-Edit, we employ block causal attention to improve inference speed.β50Feb 16, 2026Updated 2 months ago
- Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"β54Jul 5, 2025Updated 9 months ago
- Rethinking the Trust Region in LLM Reinforcement Learningβ52Mar 2, 2026Updated 2 months ago
- β21Dec 3, 2025Updated 4 months ago
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"β¦β20Nov 11, 2025Updated 5 months ago
- β37Apr 21, 2026Updated last week
- ε€ζ¦η η©Άηζ’θ―Ύθζ¬β10Feb 14, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Imagesβ69Jan 23, 2026Updated 3 months ago
- β35Updated this week
- β10Jul 6, 2021Updated 4 years ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, β¦β84Mar 9, 2026Updated last month
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"β183Feb 24, 2026Updated 2 months ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoningβ45Mar 18, 2026Updated last month
- d3LLM: Ultra-Fast Diffusion LLM πβ120Apr 25, 2026Updated last week