keirp / OpenWebMath
☆110Updated 4 months ago
Related projects: ⓘ
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆195Updated 3 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly"☆121Updated 3 months ago
- Official implementation of DPFM @ ICLR 2024 paper "Autonomous Data Selection with Language Models for Mathematical Texts" (Huggingface Da…☆73Updated this week
- Official code for "MAmmoTH2: Scaling Instructions from the Web"☆106Updated this week
- Self-Alignment with Principle-Following Reward Models☆144Updated 6 months ago
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆244Updated last week
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆104Updated 2 months ago
- Reformatted Alignment☆111Updated 4 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆148Updated 6 months ago
- ☆79Updated 3 months ago
- DSIR large-scale data selection framework for language model training☆221Updated 5 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆182Updated last month
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆65Updated last month
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆87Updated 2 months ago
- ☆179Updated this week
- ☆99Updated last year
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆208Updated last week
- ☆87Updated 4 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆101Updated last week
- An Experiment on Dynamic NTK Scaling RoPE☆59Updated 9 months ago
- ☆74Updated this week
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆133Updated 3 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆134Updated 6 months ago
- Unofficial implementation of AlpaGasus☆83Updated 11 months ago
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆153Updated 4 months ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆77Updated 2 months ago
- A Survey on Data Selection for Language Models☆148Updated 3 months ago
- Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese…☆118Updated last year
- Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆115Updated 2 weeks ago
- Repository for analysis and experiments in the BigCode project.☆113Updated 5 months ago