j0hngou / LLMCWMLinks
This is the official code repository for the paper "Language Agents Meet Causality -- Bridging LLMs and Causal World Models"
☆28Updated 9 months ago
Alternatives and similar repositories for LLMCWM
Users that are interested in LLMCWM are comparing it to the libraries listed below
Sorting:
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆115Updated 4 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆115Updated 2 years ago
- Natural Language Reinforcement Learning☆101Updated 6 months ago
- ☆32Updated last year
- ☆44Updated last year
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆125Updated 10 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆70Updated 10 months ago
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆23Updated 11 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆96Updated last year
- ☆12Updated 11 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47Updated 9 months ago
- ☆41Updated last year
- Reasoning with Language Model is Planning with World Model☆185Updated 2 years ago
- Official Implementation of "DeLLMa: Decision Making Under Uncertainty with Large Language Models"☆70Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 10 months ago
- ☆117Updated last year
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆74Updated 9 months ago
- ☆77Updated 3 months ago
- ☆67Updated 11 months ago
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆56Updated 2 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆101Updated last year
- ☆203Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆120Updated last week
- [ACL 2024] ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models☆24Updated last year
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 5 months ago
- Graph Diffusion Policy Optimization☆42Updated last year
- ☆205Updated last month
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Updated 4 months ago
- Lightweight Adapting for Black-Box Large Language Models☆25Updated last year
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆43Updated last year