Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
☆313Mar 16, 2026Updated last month
Alternatives and similar repositories for agent-world-model
Users that are interested in agent-world-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆29Feb 17, 2025Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 9 months ago
- CORAL is a robust, lightweight infrastructure for multi-agent autonomous self-evolution, built for autoresearch.☆447Updated this week
- ☆29Sep 23, 2025Updated 6 months ago
- ☆46Jun 24, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning☆608Updated this week
- ☆15Feb 13, 2025Updated last year
- HACMan++ code release. RSS 2024.☆22Dec 23, 2024Updated last year
- ☆34Dec 9, 2023Updated 2 years ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 9 months ago
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- Competitive Programming Code Template☆11Nov 6, 2022Updated 3 years ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆15Jun 6, 2025Updated 10 months ago
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆29Feb 14, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆115May 7, 2025Updated 11 months ago
- [ACL 2026] Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆49Apr 6, 2026Updated last week
- ☆42Mar 26, 2025Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated 2 months ago
- Interactive LLM Chatbot that constructs direct and transitive software dependencies as a knowledge graph and answers user's questions lev…☆33Mar 11, 2026Updated last month
- PyTorch implementation for "ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback" (https://arxiv.org/abs/2107.14035).☆16Sep 9, 2022Updated 3 years ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆33Oct 30, 2025Updated 5 months ago
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆51Feb 12, 2026Updated 2 months ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Method for Long Context RLMs using verifiable Lambda Calculus☆136Apr 1, 2026Updated 2 weeks ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆266May 5, 2025Updated 11 months ago
- ☆21Oct 22, 2025Updated 5 months ago
- [ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue☆26Oct 18, 2025Updated 5 months ago
- A holistic framework for advancing LLMs as data science agents☆40Feb 3, 2026Updated 2 months ago
- ☆100Feb 11, 2026Updated 2 months ago
- ☆50Nov 9, 2025Updated 5 months ago
- [NeurIPS'25] ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions☆34Dec 7, 2025Updated 4 months ago
- ☆18Jul 1, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆31Aug 21, 2025Updated 7 months ago
- Python SDK for CMDOP agent interaction☆41Apr 7, 2026Updated last week
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 7 months ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆27Jun 10, 2025Updated 10 months ago
- ☆52Jul 4, 2025Updated 9 months ago
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆48Mar 29, 2026Updated 2 weeks ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 7 months ago