HumainLab / Personalized_RLHFLinks
☆26Updated last year
Alternatives and similar repositories for Personalized_RLHF
Users that are interested in Personalized_RLHF are comparing it to the libraries listed below
Sorting:
- ☆184Updated last year
- Official Implementation of "Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning" at EMNLP 2024 Main Conf…☆43Updated 6 months ago
- Codes for papers on Large Language Models Personalization (LaMP)☆185Updated 11 months ago
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆79Updated 7 months ago
- ☆33Updated last year
- ☆46Updated last year
- This is the repo for the survey of Bias and Fairness in IR with LLMs.☆59Updated 5 months ago
- ☆103Updated last year
- ☆24Updated last year
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Updated 6 months ago
- Conformal Language Modeling☆32Updated 2 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆85Updated 10 months ago
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆35Updated last year
- ☆241Updated last year
- ☆18Updated last year
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆144Updated last year
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆60Updated last year
- [NeurIPS 2024] GITA: Graph to Image-Text Integration for Vision-Language Graph Reasoning☆53Updated 2 months ago
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting☆24Updated last year
- Code for paper: Are Large Language Models Post Hoc Explainers?☆34Updated last year
- ☆35Updated 7 months ago
- ☆39Updated last year
- AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection☆19Updated 2 years ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆75Updated last year
- ☆42Updated 2 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆79Updated last year
- Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization☆21Updated last year
- ☆58Updated 2 years ago
- This is the code for the ICLR 2023 paper "Leveraging Large Language Models for Multiple Choice Question Answering."☆41Updated 2 years ago
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆21Updated last year