thu-coai / BPOLinks
☆322Updated 11 months ago
Alternatives and similar repositories for BPO
Users that are interested in BPO are comparing it to the libraries listed below
Sorting:
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆372Updated 9 months ago
- ☆142Updated 11 months ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆166Updated last year
- ☆162Updated 2 years ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆392Updated 10 months ago
- ☆244Updated 3 weeks ago
- Generative Judge for Evaluating Alignment☆239Updated last year
- ☆222Updated last year
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆252Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆261Updated last year