Qihoo360 / Light-IFLinks
☆40Updated 2 months ago
Alternatives and similar repositories for Light-IF
Users that are interested in Light-IF are comparing it to the libraries listed below
Sorting:
- ☆147Updated last year
- 在verl上做reward的定制开发☆144Updated 8 months ago
- a-m-team's exploration in large language modeling☆194Updated 8 months ago
- ☆48Updated last year
- The related works and background techniques about Openai o1☆220Updated last year
- ☆432Updated 3 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆155Updated last year
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆37Updated 8 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆141Updated 7 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆90Updated last year
- 怎么训练一个LLM分词器☆153Updated 2 years ago
- Fantastic Data Engineering for Large Language Models☆93Updated last year
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆208Updated 3 weeks ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆48Updated last year
- ☆147Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆101Updated 11 months ago
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆47Updated last year
- ☆164Updated last year
- ☆42Updated 11 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆284Updated 2 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆416Updated 7 months ago
- ☆97Updated 2 weeks ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆59Updated 2 years ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆117Updated 2 years ago
- ☆54Updated last year
- ☆98Updated last year
- ☆142Updated 8 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆188Updated 7 months ago
- ☆165Updated 3 months ago
- ☆84Updated 2 years ago