j0hngou / LLMCWMLinks
This is the official code repository for the paper "Language Agents Meet Causality -- Bridging LLMs and Causal World Models"
☆28Updated 9 months ago
Alternatives and similar repositories for LLMCWM
Users that are interested in LLMCWM are comparing it to the libraries listed below
Sorting:
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆115Updated 4 months ago
- Natural Language Reinforcement Learning☆101Updated 6 months ago
- ☆32Updated last year
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Updated 4 months ago
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆56Updated 2 months ago
- ☆143Updated 2 months ago
- Graph Diffusion Policy Optimization☆42Updated last year
- Reinforcing General Reasoning without Verifiers☆96Updated 7 months ago
- ☆117Updated last year
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47Updated 9 months ago
- Open-source Agentic RL for LLMs — RLAnything & DemyAgent☆223Updated last week
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆59Updated 5 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆175Updated 4 months ago
- ☆41Updated last year
- ☆12Updated 11 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆96Updated last year
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆70Updated 10 months ago
- ☆67Updated 11 months ago
- [ICML 2024 Oral] A framework for society simulation that supports complex simulation, for example: multi-scene.☆84Updated last year
- Reinforced Multi-LLM Agents training☆70Updated 3 weeks ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆101Updated last year
- ☆76Updated 3 months ago
- ☆44Updated last year
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆74Updated 9 months ago
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆23Updated 11 months ago
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)☆194Updated 6 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆125Updated 10 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆115Updated 2 years ago
- ☆352Updated 6 months ago
- ☆229Updated 11 months ago