HumainLab / Personalized_RLHF
☆9Updated 4 months ago
Alternatives and similar repositories for Personalized_RLHF:
Users that are interested in Personalized_RLHF are comparing it to the libraries listed below
- ☆18Updated last year
- Official Implementation of "Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning" at EMNLP 2024 Main Conf…☆28Updated 3 months ago
- Code for paper: Are Large Language Models Post Hoc Explainers?☆31Updated 9 months ago
- ☆25Updated last year
- LaTeX Drawing☆11Updated 2 years ago
- Official implementation for KDD'22 paper "Learning Fair Representation via Distributional Contrastive Disentanglement"☆23Updated 2 years ago
- SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence!☆9Updated 2 months ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆73Updated last year
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆20Updated last year
- ☆164Updated 10 months ago
- ☆87Updated 9 months ago
- AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection☆17Updated last year
- Data and code for the Corr2Cause paper (ICLR 2024)☆96Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆72Updated last month
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆35Updated 3 months ago
- ☆30Updated 11 months ago
- Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization☆12Updated 4 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆53Updated 5 months ago
- A curated list of resources for graph prompting methods☆29Updated last year
- PyTorch Implementation of Prompt-augmented Temporal Point Process for Streaming Event Sequence, NeurIPS 2023☆14Updated last year
- A python package providing a benchmark with various specified distribution shift patterns.☆57Updated last year
- ☆37Updated 7 months ago
- ☆26Updated this week
- ☆40Updated last year
- This is the repo for the survey of Bias and Fairness in IR with LLMs.☆52Updated 3 weeks ago
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆29Updated 2 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆18Updated 7 months ago
- ☆33Updated 2 weeks ago
- Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆20Updated 6 months ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆66Updated 2 years ago