gao-g / preludeLinks

Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".

☆41

Alternatives and similar repositories for prelude

Users that are interested in prelude are comparing it to the libraries listed below

Sorting:

OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
Edward-Sun / easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆123Updated 10 months ago
likenneth / dialogue_action_token
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
☆25Updated last year
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆101Updated last month
icip-cas / Verifier-Engineering
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
☆61Updated 7 months ago
Yifan-Song793 / ETO
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
☆146Updated 8 months ago
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆36Updated last year
liziniu / GEM
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
☆33Updated 2 months ago
ucl-dark / llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
☆112Updated last year
zankner / CLoud
Critique-out-Loud Reward Models
☆68Updated 9 months ago
eric11eca / disco
Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…
☆37Updated last year
jwhj / OREO
☆114Updated 5 months ago
rxlqn / awesome-llm-self-reflection
augmented LLM with self reflection
☆129Updated last year
NingMiao / SelfCheck
Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>
☆48Updated last year
deeplearning-wisc / args
☆43Updated last year
causalNLP / corr2cause
Data and code for the Corr2Cause paper (ICLR 2024)
☆107Updated last year
YuxiXie / SelfEval-Guided-Decoding
☆99Updated last year
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences
☆71Updated last year
XiangLi1999 / AutoBencher
☆29Updated last year
hkust-nlp / B-STaR
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆82Updated last month
princeton-nlp / LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
☆127Updated last year
joeljang / RLPHF
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
☆108Updated last year
Agent-E3 / ExACT
☆19Updated 4 months ago
xingyaoww / mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…
☆126Updated last year
QingruZhang / PASTA
PASTA: Post-hoc Attention Steering for LLMs
☆121Updated 7 months ago
WindyLee0822 / Process_Q_Model
official implementation of paper "Process Reward Model with Q-value Rankings"
☆60Updated 5 months ago
haozheji / exact-optimization
ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment
☆58Updated last year
waterhorse1 / Natural-language-RL
Natural Language Reinforcement Learning
☆92Updated 6 months ago
scandukuri / assistant-gate
☆26Updated last year
cambridgeltl / PairS
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)
☆47Updated 5 months ago