Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
☆360May 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for agent-world-model
Users that are interested in agent-world-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PAHF Personalized Agent from Human Feedback☆50Apr 26, 2026Updated last month
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆29Feb 17, 2025Updated last year
- ☆113Apr 24, 2026Updated last month
- SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning☆769May 17, 2026Updated last week
- ☆30Sep 23, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A fast, lightweight text-to-speech tool that runs entirely on your CPU. Give it text, pick a voice, and get a WAV file out.☆65Feb 22, 2026Updated 3 months ago
- ☆47Jun 24, 2025Updated 11 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- HACMan++ code release. RSS 2024.☆22Dec 23, 2024Updated last year
- ☆34Dec 9, 2023Updated 2 years ago
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Feb 18, 2022Updated 4 years ago
- Official Implementation of "TempME: Towards the Explainability of Temporal Graph Neural Networks via Motif Discovery"☆16Mar 18, 2024Updated 2 years ago
- Spatial Aptitude Training for Multimodal Langauge Models☆32Feb 8, 2026Updated 3 months ago
- HSML Dynamic version for ICML 2019☆12Jul 11, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 10 months ago
- Competitive Programming Code Template☆10Nov 6, 2022Updated 3 years ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆15Jun 6, 2025Updated 11 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆32Feb 14, 2026Updated 3 months ago
- ☆42Mar 26, 2025Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆35May 15, 2026Updated last week
- Claude Code and Large-Context Reasoning (O'Reilly Live Learning)☆209Mar 19, 2026Updated 2 months ago
- PyTorch implementation for "ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback" (https://arxiv.org/abs/2107.14035).☆16Sep 9, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆268May 5, 2025Updated last year
- ☆23Oct 22, 2025Updated 7 months ago
- [ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue☆26Oct 18, 2025Updated 7 months ago
- ☆50Nov 9, 2025Updated 6 months ago
- ☆18Jul 1, 2023Updated 2 years ago
- [NeurIPS'25] ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions☆41Dec 7, 2025Updated 5 months ago
- A standard language for machine-readable code comments☆131Mar 17, 2026Updated 2 months ago
- ☆33Aug 21, 2025Updated 9 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Running Mixture of Agents on CPU: LFM2.5 Brain (1.2B) + Falcon-R Reasoner (600M) + Tool Caller (90M). CPU-only, 16GB RAM. Lightweight AI …☆34Feb 7, 2026Updated 3 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 9 months ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆27Jun 10, 2025Updated 11 months ago
- Sometime after this repo was made sourcemaps were turned off.☆23Feb 10, 2026Updated 3 months ago
- ☆55Jul 4, 2025Updated 10 months ago
- ☆18Jun 3, 2024Updated last year
- ☆14May 6, 2026Updated 3 weeks ago