maitrix-org / SimWorldLinks
Main repo for SimWorld simulator.
☆61Updated last week
Alternatives and similar repositories for SimWorld
Users that are interested in SimWorld are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆68Updated 2 months ago
- Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆78Updated last month
- ☆78Updated 11 months ago
- ☆85Updated last month
- A paper list that includes world models or generative video models for embodied agents.☆24Updated 7 months ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆50Updated 3 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆39Updated 8 months ago
- ☆78Updated 3 months ago
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆11Updated 3 months ago
- Evaluate Multimodal LLMs as Embodied Agents☆53Updated 6 months ago
- OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆59Updated last month
- ☆108Updated last month
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆142Updated 2 months ago
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆47Updated 3 weeks ago
- Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"☆51Updated last week
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆25Updated 2 months ago
- Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos☆135Updated 2 weeks ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆114Updated 3 weeks ago
- Virtual Community: An Open World for Humans, Robots, and Society☆168Updated last week
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆35Updated 8 months ago
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆46Updated last year
- ☆43Updated last year
- ☆29Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆67Updated 11 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆72Updated 8 months ago
- [CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"☆60Updated 2 weeks ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆147Updated 3 months ago
- The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)☆72Updated 2 weeks ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆130Updated 10 months ago
- official implementation for our paper Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance (CoRL 2024)☆32Updated 4 months ago