KbsdJames / Awesome-LLM-Preference-Learning
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
☆174Updated 6 months ago
Alternatives and similar repositories for Awesome-LLM-Preference-Learning:
Users that are interested in Awesome-LLM-Preference-Learning are comparing it to the libraries listed below
- [ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation☆167Updated 2 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆167Updated 5 months ago
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆347Updated 2 months ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆148Updated last year
- Controllable Text Generation for Large Language Models: A Survey☆171Updated 8 months ago
- ☆155Updated 2 weeks ago
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆190Updated 7 months ago
- Recipes to train the self-rewarding reasoning LLMs.☆214Updated 2 months ago
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆215Updated 6 months ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆164Updated 5 months ago
- Fantastic Data Engineering for Large Language Models☆87Updated 4 months ago
- [ICLR 2025] Tool-Planner: Task Planning with Clusters across Multiple Tools☆107Updated 3 months ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆246Updated 3 months ago
- [🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering☆143Updated 2 months ago
- The official code repository for PRMBench.☆72Updated 2 months ago
- ☆55Updated 6 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding☆119Updated last month
- A recipe for online RLHF and online iterative DPO.☆508Updated 4 months ago
- Benchmarking LLMs via Uncertainty Quantification☆225Updated last year
- [ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models☆70Updated last month
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆60Updated 6 months ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆49Updated 6 months ago
- ☆95Updated last month
- Codebase for Iterative DPO Using Rule-based Rewards☆243Updated 3 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆119Updated last month
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆90Updated 3 weeks ago
- The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆152Updated last month
- ☆203Updated 5 months ago
- A Comprehensive Survey on Long Context Language Modeling☆138Updated last month