KbsdJames / Awesome-LLM-Preference-Learning
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
☆160Updated 3 months ago
Alternatives and similar repositories for Awesome-LLM-Preference-Learning:
Users that are interested in Awesome-LLM-Preference-Learning are comparing it to the libraries listed below
- [ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation☆150Updated 3 weeks ago
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆311Updated last week
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆155Updated 2 months ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆143Updated 10 months ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆158Updated 3 months ago
- Controllable Text Generation for Large Language Models: A Survey☆157Updated 5 months ago
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆189Updated 5 months ago
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆186Updated 4 months ago
- Fantastic Data Engineering for Large Language Models☆71Updated last month
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆58Updated 4 months ago
- The official code repository for PRMBench.☆64Updated last week
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆266Updated 2 months ago
- notes for Multi-hop Reading Comprehension and open-domain question answering☆85Updated 2 years ago
- [ICLR 2025] Tool-Planner: Task Planning with Clusters across Multiple Tools☆102Updated 3 weeks ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆223Updated 2 weeks ago
- ☆203Updated 2 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 2 months ago
- This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use".☆201Updated this week
- The framework to prune LLMs to any size and any config.☆87Updated 11 months ago
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆202Updated 3 months ago
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆149Updated 2 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆107Updated 7 months ago
- StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding☆73Updated this week
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆93Updated last year
- Toolkit for Prompt Compression☆243Updated last week
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Updated 6 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆154Updated last month
- The official repository of the Omni-MATH benchmark.☆71Updated 2 months ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models".☆40Updated 3 months ago