liyongqi2002 / Awesome-Personalized-Alignment
A curated list of personalized alignment resources (continually updated).
☆16Updated this week
Alternatives and similar repositories for Awesome-Personalized-Alignment:
Users that are interested in Awesome-Personalized-Alignment are comparing it to the libraries listed below
- ☆113Updated 7 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆34Updated 3 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆112Updated 7 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆190Updated last week
- ☆26Updated this week
- ☆71Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆115Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆72Updated this week
- ☆93Updated last week
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- ☆42Updated 5 months ago
- ☆132Updated 9 months ago
- The official code repository for PRMBench.☆72Updated 2 months ago
- The code and data of DPA-RAG☆58Updated 3 months ago
- ☆17Updated 7 months ago
- ☆34Updated last month
- ☆11Updated last month
- ☆55Updated 6 months ago
- ☆72Updated 10 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 4 months ago
- A comprehensive collection of process reward models.☆67Updated this week
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆49Updated 3 weeks ago
- [ACL'2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"☆12Updated 7 months ago
- ☆44Updated 5 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆107Updated this week
- ☆26Updated 2 months ago
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)☆46Updated 2 months ago
- ☆73Updated 11 months ago
- ☆90Updated 3 months ago