j0hngou / LLMCWMLinks
This is the official code repository for the paper "Language Agents Meet Causality -- Bridging LLMs and Causal World Models"
☆18Updated 2 months ago
Alternatives and similar repositories for LLMCWM
Users that are interested in LLMCWM are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆11Updated 4 months ago
- Repo for Anonymous purpose, pls don't distribute☆10Updated 9 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆85Updated 6 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 3 weeks ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆78Updated 3 months ago
- Natural Language Reinforcement Learning☆92Updated 7 months ago
- ☆18Updated 6 months ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆27Updated 7 months ago
- ☆29Updated last week
- ☆32Updated 3 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆29Updated last year
- ☆16Updated 2 months ago
- Reinforcing General Reasoning without Verifiers☆71Updated 3 weeks ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆60Updated 5 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆10Updated 5 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆67Updated 2 months ago
- ☆318Updated last month
- ☆61Updated 4 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆35Updated last week
- Adapt MLLMs to Domains via Post-Training☆9Updated 6 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆78Updated last month
- ☆19Updated last month
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆18Updated 8 months ago
- Triple Preference Optimization☆26Updated 5 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆103Updated 2 weeks ago
- ☆210Updated 4 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆45Updated 2 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆130Updated this week
- ☆11Updated 4 months ago