Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
☆335May 1, 2026Updated this week
Alternatives and similar repositories for agent-world-model
Users that are interested in agent-world-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PAHF Personalized Agent from Human Feedback☆47Apr 26, 2026Updated last week
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆29Feb 17, 2025Updated last year
- ATS for NeurIPS 2021☆24Nov 4, 2021Updated 4 years ago
- ☆108Apr 24, 2026Updated 2 weeks ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆29Sep 23, 2025Updated 7 months ago
- ☆47Jun 24, 2025Updated 10 months ago
- ☆14Feb 5, 2014Updated 12 years ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- ☆15Feb 13, 2025Updated last year
- HACMan++ code release. RSS 2024.☆22Dec 23, 2024Updated last year
- ☆34Dec 9, 2023Updated 2 years ago
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Feb 18, 2022Updated 4 years ago
- SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning☆702Apr 11, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Spatial Aptitude Training for Multimodal Langauge Models☆31Feb 8, 2026Updated 2 months ago
- HSML Dynamic version for ICML 2019☆12Jul 11, 2019Updated 6 years ago
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 9 months ago
- CORAL is a robust, lightweight infrastructure for multi-agent autonomous self-evolution, built for autoresearch.☆608Updated this week
- Competitive Programming Code Template☆11Nov 6, 2022Updated 3 years ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆15Jun 6, 2025Updated 11 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆32Feb 14, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆42Mar 26, 2025Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆32Jan 25, 2026Updated 3 months ago
- Tool that gathers a customizable set of ETW telemetry and generates user-defined detections☆53Jan 28, 2026Updated 3 months ago
- Interactive LLM Chatbot that constructs direct and transitive software dependencies as a knowledge graph and answers user's questions lev…☆33Mar 11, 2026Updated last month
- Git worktree sandboxes, locally or on remote hosts over SSH.☆35Feb 9, 2026Updated 2 months ago
- View and manage Claude Code tasks and memory in a floating Hammerspoon window with live updates.☆31Apr 1, 2026Updated last month
- PyTorch implementation for "ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback" (https://arxiv.org/abs/2107.14035).☆16Sep 9, 2022Updated 3 years ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆266May 5, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Storing long contexts in tiny caches with self-study☆264Mar 23, 2026Updated last month
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆160Apr 6, 2025Updated last year
- [ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue☆26Oct 18, 2025Updated 6 months ago
- A holistic framework for advancing LLMs as data science agents☆40Feb 3, 2026Updated 3 months ago
- ☆101Feb 11, 2026Updated 2 months ago
- [NeurIPS'25] ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions☆39Dec 7, 2025Updated 5 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 8 months ago