Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
☆87Updated this week
Alternatives and similar repositories for EASI
Users that are interested in EASI are comparing it to the libraries listed below
Sorting:
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆52Dec 28, 2025Updated 2 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 6 months ago
- ☆13Feb 20, 2026Updated last week
- Self Evolving Large Multimodal Models with Continuous Rewards☆19Nov 21, 2025Updated 3 months ago
- ☆16Oct 12, 2025Updated 4 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 3 months ago
- A Curated List of Vision-Language-Action (VLA) Research☆61Updated this week
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆36Updated this week
- ☆22Dec 24, 2025Updated 2 months ago
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆26Jul 15, 2025Updated 7 months ago
- ☆36Dec 16, 2025Updated 2 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆54Feb 11, 2026Updated 2 weeks ago
- ☆54Nov 12, 2025Updated 3 months ago
- ☆56Oct 25, 2025Updated 4 months ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆38Jul 5, 2025Updated 7 months ago
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆399Mar 19, 2025Updated 11 months ago
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆44Jan 5, 2026Updated last month
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆52Jan 12, 2026Updated last month
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆72Jan 23, 2026Updated last month
- EO: Open-source Unified Embodied Foundation Model Series☆51Jan 15, 2026Updated last month
- [KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations☆90Feb 6, 2026Updated 3 weeks ago
- Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"☆53Jul 5, 2025Updated 7 months ago
- InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models☆84Feb 2, 2026Updated 3 weeks ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆44Jan 26, 2024Updated 2 years ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated last month
- Official Repository of Native Parallel Reasoner☆100Feb 5, 2026Updated 3 weeks ago
- Visual Spatial Tuning☆176Feb 19, 2026Updated last week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"☆261Dec 31, 2025Updated 2 months ago
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆115Dec 12, 2025Updated 2 months ago
- Official repo of Toucan: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments☆227Dec 16, 2025Updated 2 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated 11 months ago
- ☆10Jul 6, 2021Updated 4 years ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆122Feb 21, 2026Updated last week
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week