KbsdJames / Awesome-LLM-Preference-LearningLinks
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
☆177Updated 7 months ago
Alternatives and similar repositories for Awesome-LLM-Preference-Learning
Users that are interested in Awesome-LLM-Preference-Learning are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation☆171Updated 3 months ago
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆351Updated 3 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆168Updated 5 months ago
- Controllable Text Generation for Large Language Models: A Survey☆175Updated 9 months ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆147Updated last year
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆167Updated 6 months ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆232Updated last week
- ☆196Updated 3 weeks ago
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆190Updated 8 months ago
- Recipes to train the self-rewarding reasoning LLMs.☆219Updated 3 months ago
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆222Updated 7 months ago
- Codebase for Iterative DPO Using Rule-based Rewards☆245Updated last month
- [ICLR 2025] Tool-Planner: Task Planning with Clusters across Multiple Tools☆110Updated 3 weeks ago
- [🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering☆143Updated 3 months ago
- [ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models☆72Updated 2 months ago
- The official code repository for PRMBench.☆73Updated 3 months ago
- Benchmarking LLMs via Uncertainty Quantification☆230Updated last year
- Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`☆17Updated 3 months ago
- Grimoire is All You Need for Enhancing Large Language Models☆115Updated last year
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆154Updated 6 months ago
- ☆131Updated 2 weeks ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆255Updated this week
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆93Updated last year
- [ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Models☆142Updated 6 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 5 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆149Updated 2 months ago
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆60Updated 7 months ago
- A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024 (Distinguished Pape…☆113Updated 6 months ago
- The awesome agents in the era of large language models☆64Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 6 months ago