HqWu-HITCS / Awesome-Personalized-LLM
This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.
☆121Updated 7 months ago
Alternatives and similar repositories for Awesome-Personalized-LLM
Users that are interested in Awesome-Personalized-LLM are comparing it to the libraries listed below
Sorting:
- Reformatted Alignment☆114Updated 7 months ago
- Awesome papers for role-playing with language models☆186Updated 6 months ago
- ☆88Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆116Updated 7 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆80Updated last year
- Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models☆106Updated last week
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆236Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆142Updated last month
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆119Updated 6 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆56Updated last year
- ☆196Updated 2 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆146Updated 3 weeks ago
- Awesome Agent Training☆113Updated last week
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆123Updated 10 months ago
- ☆102Updated 5 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆48Updated this week
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 10 months ago
- ☆49Updated last year
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆202Updated this week
- Code implementation of synthetic continued pretraining☆109Updated 4 months ago
- ☆97Updated 2 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆126Updated 6 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆72Updated 3 weeks ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆151Updated 8 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆93Updated 3 months ago
- ☆153Updated 3 weeks ago
- ☆151Updated 2 weeks ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆96Updated 3 weeks ago
- The demo, code and data of FollowRAG☆72Updated 3 weeks ago