maitrix-org / SimWorldLinks

[NeurIPS 2025 Spotlight] SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

☆65

Alternatives and similar repositories for SimWorld

Users that are interested in SimWorld are comparing it to the libraries listed below

Sorting:

USC-GVL / PhysBench
[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …
☆72Updated 4 months ago
UMass-Embodied-AGI / MindJourney
Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"
☆86Updated 2 months ago
video-to-action / video-to-action-release
[ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration
☆55Updated 5 months ago
rainbow979 / robodreamer
☆80Updated last year
QinengWang-Aiden / Awesome-embodied-world-model-papers
A paper list that includes world models or generative video models for embodied agents.
☆25Updated 8 months ago
YunzeMan / Situation3D
[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning
☆41Updated 10 months ago
mll-lab-nu / MindCube
☆90Updated last week
Singularity0104 / equilibrium-planner
[ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling
☆11Updated 5 months ago
video-language-planning / vlp_code
☆76Updated 4 months ago
OpenHelix-Team / VLA-RFT
VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning
☆45Updated this week
InternRobotics / OST-Bench
[NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
☆59Updated last week
MSR3D / MSR3D
[NeurIPS 2024] Official code repository for MSR3D paper
☆64Updated 2 months ago
Little-Podi / AdaWorld
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆161Updated 3 months ago
JeffWang987 / EgoVid
[Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
☆118Updated 2 months ago
Kami-code / HandsOnVLM-release
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
☆36Updated 3 weeks ago
joyhsu0504 / LEFT
☆45Updated last year
thunlp / EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
☆54Updated 7 months ago
HeegerGao / FLIP
Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks
☆73Updated 9 months ago
sg-3d / sg3d
☆51Updated last year
bytedance / IRASim
☆116Updated 3 months ago
ykarmesh / stable-control-representations
Code for Stable Control Representations
☆25Updated 6 months ago
google-deepmind / robovqa
☆30Updated last year
InternRobotics / MMSI-Bench
[arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
☆53Updated 2 months ago
cvlab-columbia / videopolicy
☆27Updated 2 months ago
declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆74Updated 4 months ago
xiaoxiao0406 / VQ-VLA
The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
☆81Updated last month
OpenGVLab / VeBrain
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
☆81Updated 4 months ago
sled-group / 3D-GRAND
[CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
☆46Updated last year
robomonkey-vla / RoboMonkey
☆16Updated 2 months ago
cvlab-columbia / dreamitate
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)
☆52Updated 4 months ago