Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
☆254Feb 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for agent-world-model
Users that are interested in agent-world-model are comparing it to the libraries listed below
Sorting:
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- ☆46Jun 24, 2025Updated 8 months ago
- SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning☆77Updated this week
- A holistic framework for advancing LLMs as data science agents☆33Feb 3, 2026Updated last month
- Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code☆36Feb 27, 2026Updated last week
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 9 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- ☆21Oct 22, 2025Updated 4 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- PAHF Personalized Agent from Human Feedback☆37Feb 25, 2026Updated last week
- ☆28Sep 23, 2025Updated 5 months ago
- ☆14Feb 13, 2025Updated last year
- Spatial Aptitude Training for Multimodal Langauge Models☆24Feb 8, 2026Updated 3 weeks ago
- Official implementation of "Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning"☆16Jan 22, 2025Updated last year
- Implementation for MomentumSMoE☆19Apr 19, 2025Updated 10 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration☆41Jan 7, 2026Updated last month
- This is the official implementation of the paper “Griffin: Towards a Graph-Centric Relational Database Foundation Model.”☆34Sep 25, 2025Updated 5 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆81Nov 16, 2025Updated 3 months ago
- Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.☆109Feb 11, 2026Updated 3 weeks ago
- ☆93Dec 30, 2025Updated 2 months ago
- HACMan++ code release. RSS 2024.☆22Dec 23, 2024Updated last year
- Hydragen: High-Throughput LLM Inference with Shared Prefixes☆48May 10, 2024Updated last year
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated 11 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- a survey on deep research☆47Sep 9, 2025Updated 5 months ago
- ☆49Jul 22, 2024Updated last year
- ☆26Jun 22, 2024Updated last year
- ☆17Aug 1, 2025Updated 7 months ago
- ☆18Sep 5, 2024Updated last year
- ☆115May 7, 2025Updated 9 months ago
- Vocabulary Parallelism☆25Mar 10, 2025Updated 11 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Apr 2, 2024Updated last year
- ☆44Feb 27, 2026Updated last week
- My personal web page☆11Feb 17, 2026Updated 2 weeks ago
- ☆27Jan 22, 2025Updated last year
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆83Jan 16, 2026Updated last month
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆27Jun 10, 2025Updated 8 months ago