gao-g / prelude
Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".
☆34Updated last month
Alternatives and similar repositories for prelude:
Users that are interested in prelude are comparing it to the libraries listed below
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆53Updated 10 months ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆36Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆109Updated 2 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆35Updated 2 weeks ago
- ☆26Updated 6 months ago
- ☆20Updated 7 months ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆20Updated 6 months ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆98Updated last year
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆110Updated 7 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆66Updated 2 weeks ago
- ☆25Updated 8 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆51Updated 9 months ago
- Critique-out-Loud Reward Models☆47Updated 3 months ago
- CodeUltraFeedback: aligning large language models to coding preferences☆66Updated 6 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆111Updated last year
- ☆93Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆63Updated last month
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆111Updated 4 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆107Updated last month
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆40Updated 3 weeks ago
- ☆36Updated 5 months ago
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆29Updated last year
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆134Updated last month
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 4 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆57Updated 6 months ago
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆59Updated 5 months ago
- Evaluate the Quality of Critique☆35Updated 7 months ago