xqlin98 / APOHFLinks
Prompt Optimization with Human Feedback
☆17Updated last year
Alternatives and similar repositories for APOHF
Users that are interested in APOHF are comparing it to the libraries listed below
Sorting:
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆115Updated 4 months ago
- Implementation of the MATRIX framework (ICML 2024)☆60Updated last year
- Reinforced Multi-LLM Agents training☆70Updated 3 weeks ago
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆56Updated 2 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆144Updated last year
- ☆77Updated 3 months ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Updated 2 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆56Updated 3 months ago
- ☆51Updated 9 months ago
- This is the code of MMOA-RAG.☆102Updated 9 months ago
- ☆54Updated 6 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆125Updated 10 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆143Updated 11 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆70Updated 10 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆72Updated 11 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 5 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆30Updated last year
- [ICML 2024] One Prompt is Not Enough: Automated Construction of a Mixture-of-Expert Prompts - TurningPoint AI☆31Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆114Updated last week
- ☆117Updated last year
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆43Updated last year
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration☆187Updated 2 weeks ago
- ☆53Updated 11 months ago
- ☆56Updated 11 months ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆217Updated 3 months ago
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Updated 3 months ago
- ☆46Updated last year
- [ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"☆34Updated 2 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆101Updated last year
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆18Updated last year