huggingface / cosmopedia
☆515Updated 5 months ago
Alternatives and similar repositories for cosmopedia:
Users that are interested in cosmopedia are comparing it to the libraries listed below
- Generative Representational Instruction Tuning☆624Updated last month
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆459Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆254Updated 9 months ago
- RewardBench: the first evaluation tool for reward models.☆562Updated 2 months ago
- Official repository for ORPO☆450Updated 11 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆693Updated last month
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆354Updated 7 months ago
- The official evaluation suite and dynamic data release for MixEval.☆238Updated 5 months ago
- A bagel, with everything.☆320Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆301Updated last year
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).