OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
โ245Updated 6 months ago
Alternatives and similar repositories for gsm8k-ScRel:
Users that are interested in gsm8k-ScRel are comparing it to the libraries listed below
- โ142Updated 2 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningโ243Updated last year
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ184Updated 10 months ago
- โ326Updated last month
- โ261Updated 7 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMsโ241Updated 2 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.โ293Updated 7 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodingsโ153Updated 9 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moโฆโ342Updated 6 months ago
- A large-scale, fine-grained, diverse preference dataset (and models).โ331Updated last year
- Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasetsโ313Updated last year
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Modelsโ177Updated 5 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witโฆโ115Updated 8 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.โ129Updated last year
- Repo of paper "Free Process Rewards without Process Labels"โ132Updated this week
- ๐ An unofficial implementation of Self-Alignment with Instruction Backtranslation.โ137Updated 8 months ago
- โ59Updated 3 months ago
- โ65Updated 11 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningโ144Updated 6 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuningโ416Updated 4 months ago
- The official repository of the Omni-MATH benchmark.โ74Updated 2 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]โ536Updated 3 months ago
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmarkโ371Updated 8 months ago
- The related works and background techniques about Openai o1โ216Updated 2 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsโ158Updated 8 months ago
- [ICLR 2025] ๐งฌ RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)โ111Updated 3 weeks ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scieโฆโ132Updated 7 months ago