KbsdJames / Awesome-LLM-Preference-Learning
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
☆145Updated last week
Related projects ⓘ
Alternatives and complementary repositories for Awesome-LLM-Preference-Learning
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆257Updated this week
- xFinder: Robust and Pinpoint Answer Extraction for Large Language Models☆143Updated last week
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆157Updated 2 weeks ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆182Updated last month
- Controllable Text Generation for Large Language Models: A Survey☆140Updated 2 months ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆125Updated 7 months ago
- Benchmarking LLMs via Uncertainty Quantification☆219Updated 9 months ago
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆137Updated 3 weeks ago
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆179Updated 2 months ago
- Grimoire is All You Need for Enhancing Large Language Models☆116Updated 8 months ago
- [🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering☆126Updated 2 weeks ago
- MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆124Updated 3 weeks ago
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆117Updated last year
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆65Updated last month
- Multi-Agent-GPT: 一款基于RAG和agent构建的多模态专家助手GPT。它集成了文本、图像和音频等模态工具。支持本地部署和私有数据库建设。☆221Updated 8 months ago
- Fantastic Data Engineering for Large Language Models☆49Updated 3 months ago
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆141Updated last week
- A recipe for online RLHF and online iterative DPO.☆413Updated this week
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆18Updated last week
- [ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?☆148Updated last month
- ☆114Updated last year
- notes for Multi-hop Reading Comprehension and open-domain question answering☆87Updated 2 years ago
- Connect agents to live web environments evaluation.☆196Updated last month
- Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering☆101Updated 4 months ago
- [EMNLP 2023] FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models☆85Updated 10 months ago
- Prompt Learning using Metaheuristics☆135Updated 8 months ago
- ☆98Updated 5 months ago
- ☆190Updated this week
- A Comprehensive Benchmark for Code Information Retrieval.☆63Updated 2 weeks ago
- Toolkit for Prompt Compression☆244Updated 3 weeks ago