j0hngou / LLMCWMLinks
This is the official code repository for the paper "Language Agents Meet Causality -- Bridging LLMs and Causal World Models"
☆25Updated 6 months ago
Alternatives and similar repositories for LLMCWM
Users that are interested in LLMCWM are comparing it to the libraries listed below
Sorting:
- Natural Language Reinforcement Learning☆99Updated 3 months ago
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆93Updated last month
- ☆33Updated last year
- ☆105Updated this week
- ☆35Updated last year
- ☆116Updated 9 months ago
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆52Updated 5 months ago
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆23Updated 8 months ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆39Updated last month
- ☆12Updated 8 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆121Updated 7 months ago
- ☆63Updated 8 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆112Updated last year
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆97Updated 10 months ago
- ☆40Updated last year
- Graph Diffusion Policy Optimization☆41Updated last year
- ☆69Updated this week
- Reinforced Multi-LLM Agents training☆56Updated 5 months ago
- Reasoning with Language Model is Planning with World Model☆175Updated 2 years ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆46Updated 5 months ago
- ☆20Updated 3 months ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆65Updated last year
- Reinforcing General Reasoning without Verifiers☆91Updated 4 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆91Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆30Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆109Updated 3 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆67Updated 7 months ago
- ☆144Updated last year
- ☆106Updated last year
- ☆50Updated 5 months ago