flowersteam / WorldLLMLinks
LLM as World Models using Bayesian inference
☆16Updated 7 months ago
Alternatives and similar repositories for WorldLLM
Users that are interested in WorldLLM are comparing it to the libraries listed below
Sorting:
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆33Updated last month
- The original Shared Recurrent Memory Transformer implementation☆33Updated 6 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 10 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆89Updated 7 months ago
- ☆29Updated 2 months ago
- Verlog: A Multi-turn RL framework for LLM agents☆67Updated this week
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆72Updated last year
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆27Updated 11 months ago
- ☆30Updated last year
- ☆99Updated last week
- Benchmarking Agentic LLM and VLM Reasoning On Games☆225Updated last month
- Universal Reasoning Model☆119Updated this week
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Updated 9 months ago
- Official repo of paper LM2☆46Updated 11 months ago
- ☆11Updated last year
- ☆29Updated 10 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated last month
- this is for fun, ain't it grand!☆21Updated 4 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆36Updated 3 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆66Updated 11 months ago
- ☆66Updated 10 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆173Updated 4 months ago
- ☆45Updated 6 months ago
- ☆86Updated 2 years ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆88Updated 10 months ago
- ☆35Updated 8 months ago
- ☆27Updated 11 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 4 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆73Updated 2 years ago
- General multi-task deep RL Agent☆185Updated last year