flowersteam / WorldLLMLinks
LLM as World Models using Bayesian inference
☆16Updated 8 months ago
Alternatives and similar repositories for WorldLLM
Users that are interested in WorldLLM are comparing it to the libraries listed below
Sorting:
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆35Updated 3 weeks ago
- The original Shared Recurrent Memory Transformer implementation☆33Updated 6 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆66Updated 11 months ago
- Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"☆67Updated this week
- Verlog: A Multi-turn RL framework for LLM agents☆67Updated 3 weeks ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated last month
- ☆100Updated last week
- Official repo of paper LM2☆46Updated 11 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆92Updated 8 months ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Updated last week
- General multi-task deep RL Agent☆185Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆73Updated last year
- ☆29Updated 3 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Updated 4 months ago
- ☆11Updated last year
- ☆87Updated 2 years ago
- Repository for the paper Stream of Search: Learning to Search in Language☆153Updated last year
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Updated 10 months ago
- ☆123Updated last week
- ☆35Updated 8 months ago
- ☆25Updated 8 months ago
- ☆32Updated last year
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆88Updated this week
- Universal Reasoning Model☆122Updated 3 weeks ago
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆26Updated 11 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆175Updated 4 months ago
- ☆42Updated last year
- Benchmarking Agentic LLM and VLM Reasoning On Games☆228Updated 2 months ago
- ☆30Updated last year
- ☆56Updated last year