Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
☆377May 28, 2026Updated 3 weeks ago
Alternatives and similar repositories for agent-world-model
Users that are interested in agent-world-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet another dynamic batch sampler for variable sequence data in PyTorch.☆13Dec 9, 2021Updated 4 years ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆29Feb 17, 2025Updated last year
- ATS for NeurIPS 2021☆24Nov 4, 2021Updated 4 years ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 11 months ago
- PAHF Personalized Agent from Human Feedback☆54Apr 26, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning☆842May 17, 2026Updated last month
- ☆30Sep 23, 2025Updated 8 months ago
- A fast, lightweight text-to-speech tool that runs entirely on your CPU. Give it text, pick a voice, and get a WAV file out.☆65Feb 22, 2026Updated 3 months ago
- ☆47Jun 24, 2025Updated 11 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- ☆16Feb 13, 2025Updated last year
- HACMan++ code release. RSS 2024.☆22Dec 23, 2024Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Feb 18, 2022Updated 4 years ago
- Spatial Aptitude Training for Multimodal Langauge Models☆33Feb 8, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 11 months ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆15Jun 6, 2025Updated last year
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- CORAL is a robust, lightweight infrastructure for multi-agent autonomous self-evolution, built for autoresearch. Works with Claude Code, …☆728Updated this week
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆32Feb 14, 2026Updated 4 months ago
- ☆121May 7, 2025Updated last year
- ☆42Mar 26, 2025Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆36May 15, 2026Updated last month
- Interactive LLM Chatbot that constructs direct and transitive software dependencies as a knowledge graph and answers user's questions lev…☆33Jun 8, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆56May 5, 2026Updated last month
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆268May 5, 2025Updated last year
- Code and dataset for SIGIR 2017 short paper "Automatically Extracting High-Quality Negative Examples for Answer Selection in Question Ans…☆10Aug 1, 2017Updated 8 years ago
- Storing long contexts in tiny caches with self-study☆273Mar 23, 2026Updated 2 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆161Apr 6, 2025Updated last year
- ☆23Oct 22, 2025Updated 7 months ago
- ☆50Nov 9, 2025Updated 7 months ago
- ☆18Jul 1, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆32Aug 21, 2025Updated 9 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 9 months ago
- Toward Practical Entity Alignment Method Design: Insights from New Highly Heterogeneous Knowledge Graph Datasets☆17Feb 18, 2025Updated last year
- Java Image Processing Pipeline (JIPipe) is a graphical batch processing language for the ImageJ ecosystem☆17May 27, 2025Updated last year
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆27Jun 10, 2025Updated last year
- ☆60Jul 4, 2025Updated 11 months ago
- [NeurIPS'25] ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions☆47Dec 7, 2025Updated 6 months ago