UMass-Embodied-AGI / Virtual-CommunityLinks
Virtual Community: An Open World for Humans, Robots, and Society
☆181Updated last month
Alternatives and similar repositories for Virtual-Community
Users that are interested in Virtual-Community are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆125Updated 2 months ago
- ☆165Updated 3 weeks ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆199Updated 3 months ago
- SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds☆322Updated last week
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆201Updated 8 months ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆194Updated 7 months ago
- Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models☆124Updated 3 weeks ago
- ☆223Updated 3 months ago
- VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning☆121Updated 3 months ago
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆43Updated 2 weeks ago
- Official code for EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models☆96Updated 7 months ago
- ☆118Updated 2 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆219Updated 3 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆79Updated last year
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆53Updated last week
- Official implementation of "Self-Improving Video Generation"☆78Updated 9 months ago
- [ICCV 2025] RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆103Updated 4 months ago
- Ctrl-World: A Controllable Generative World Model for Robot Manipualtion☆259Updated last month
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆133Updated last year
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆167Updated 3 months ago
- Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation☆102Updated 6 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆183Updated 3 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆83Updated last week
- ☆91Updated last year
- VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos☆273Updated last week
- ☆162Updated last year
- ☆139Updated 6 months ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆59Updated 8 months ago
- InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆91Updated 4 months ago
- ☆78Updated 8 months ago