CodeCreator / datatools
Common tools for data processing
☆12Updated last month
Alternatives and similar repositories for datatools
Users that are interested in datatools are comparing it to the libraries listed below
Sorting:
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆179Updated 2 months ago
- ☆44Updated 9 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆75Updated 5 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆77Updated this week
- ☆59Updated 8 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆101Updated 2 months ago
- LightThinker: Thinking Step-by-Step Compression☆44Updated last month
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 6 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated last week
- Code for Zero-Shot Tokenizer Transfer☆127Updated 3 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 11 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆21Updated 8 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆61Updated 10 months ago
- The HELMET Benchmark☆143Updated 3 weeks ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 5 months ago
- Revisiting Mid-training in the Era of RL Scaling☆37Updated 2 weeks ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- ☆24Updated 3 weeks ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆110Updated 3 weeks ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆42Updated 7 months ago
- ☆22Updated 3 months ago
- ☆151Updated 4 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆23Updated 2 weeks ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆30Updated 10 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Long Context Extension and Generalization in LLMs☆54Updated 7 months ago
- ☆51Updated 6 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆93Updated last week
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Updated last year