XueyangFeng / ReHAC
Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"
☆32Updated 7 months ago
Alternatives and similar repositories for ReHAC:
Users that are interested in ReHAC are comparing it to the libraries listed below
- A Comprehensive Library for Memory of LLM-based Agents.☆15Updated 2 months ago
- ☆26Updated 2 months ago
- ☆58Updated 4 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 6 months ago
- ☆26Updated 2 weeks ago
- ☆30Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆43Updated 6 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆48Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆138Updated 6 months ago
- Code and data for "The Power of Noise: Redefining Retrieval for RAG Systems"☆53Updated 6 months ago
- ☆41Updated last year
- This repo is reproduction resources for linear alignment paper, still working☆17Updated 11 months ago
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆16Updated 3 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆114Updated 7 months ago
- ☆45Updated 6 months ago
- ☆21Updated 6 months ago
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆35Updated 8 months ago
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆24Updated last year
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆36Updated last month
- ☆17Updated 8 months ago
- ☆86Updated last year
- A research repo for experiments about Reinforcement Finetuning☆46Updated last month
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆16Updated 4 months ago
- Enhancing contextual understanding in large language models through contrastive decoding☆17Updated last year
- ☆23Updated last year
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆31Updated 6 months ago
- ☆18Updated 7 months ago