Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
☆109May 11, 2026Updated last week
Alternatives and similar repositories for EASI
Users that are interested in EASI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆56Dec 28, 2025Updated 4 months ago
- ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…☆49Nov 27, 2025Updated 5 months ago
- The official repository of the first version of ACE-Brain foundation model.☆76Mar 13, 2026Updated 2 months ago
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆23Mar 5, 2026Updated 2 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆78Feb 27, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Mar 18, 2026Updated 2 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated 3 weeks ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- ☆20Oct 12, 2025Updated 7 months ago
- ☆25Apr 6, 2026Updated last month
- Vero: An Open RL Recipe for General Visual Reasoning☆121Apr 19, 2026Updated last month
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆68Mar 22, 2026Updated 2 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆26Apr 4, 2026Updated last month
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30Mar 18, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [MICCAI 2024] HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training☆22Nov 17, 2024Updated last year
- [ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation☆39Dec 16, 2025Updated 5 months ago
- Internal utility libraries for Pkl☆16May 14, 2026Updated last week
- WorldSense benchmark for grounded reasoning in language models☆24Nov 28, 2023Updated 2 years ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Mar 12, 2026Updated 2 months ago
- [MedIA 2026] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation☆31Feb 16, 2026Updated 3 months ago
- Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training☆182May 5, 2026Updated 2 weeks ago
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆427Mar 19, 2025Updated last year
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆30Jul 15, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆41Jul 5, 2025Updated 10 months ago
- A local AI assistant running on your device. It turns your files into actionable memory.☆55Mar 24, 2026Updated last month
- [ICLR 2025] Official code of the paper "Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement"☆30Updated this week
- Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights☆32Jan 9, 2026Updated 4 months ago
- Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"☆55Jul 5, 2025Updated 10 months ago
- ☆22Dec 3, 2025Updated 5 months ago
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆20Nov 11, 2025Updated 6 months ago
- ☆37Apr 21, 2026Updated last month
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆69Jan 23, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Toy O☆16Sep 21, 2024Updated last year
- ☆10Jul 6, 2021Updated 4 years ago
- ☆52Jul 6, 2025Updated 10 months ago
- A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models☆113Updated this week
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆47Mar 18, 2026Updated 2 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆84Mar 9, 2026Updated 2 months ago
- PyTorch Implementation for InMaP☆12Oct 28, 2023Updated 2 years ago