bowen-upenn / Agent_RationalityLinks
[NAACL 2025] Towards Rationality in Language and Multimodal Agents: A Survey
☆35Updated 10 months ago
Alternatives and similar repositories for Agent_Rationality
Users that are interested in Agent_Rationality are comparing it to the libraries listed below
Sorting:
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆137Updated last year
- [ICML 2024] One Prompt is Not Enough: Automated Construction of a Mixture-of-Expert Prompts - TurningPoint AI☆31Updated last year
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆85Updated last week
- ☆146Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆147Updated last year
- ☆42Updated last year
- augmented LLM with self reflection☆135Updated 2 years ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆63Updated 11 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆132Updated last year
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆97Updated 11 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆31Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆40Updated 2 years ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆112Updated 4 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆111Updated last year
- Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".☆43Updated last year
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆69Updated 2 years ago
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆113Updated 9 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆125Updated last year
- ☆135Updated 9 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆50Updated last year
- Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"☆38Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 9 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆148Updated 2 years ago
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆43Updated last year
- Code for "Reasoning to Learn from Latent Thoughts"☆123Updated 8 months ago
- Natural Language Reinforcement Learning☆100Updated 4 months ago
- ☆103Updated last year
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆131Updated 8 months ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆109Updated 6 months ago
- ☆53Updated 10 months ago