[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches
☆62Mar 4, 2025Updated 11 months ago
Alternatives and similar repositories for PreSelect
Users that are interested in PreSelect are comparing it to the libraries listed below
Sorting:
- An automated data pipeline scaling RL to pretraining levels☆72Oct 11, 2025Updated 4 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆33Feb 7, 2025Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆11Jun 16, 2024Updated last year
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- ☆30Nov 5, 2024Updated last year
- [NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey☆36May 18, 2025Updated 9 months ago
- ☆15Mar 12, 2024Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆15Nov 18, 2023Updated 2 years ago
- ☆14Apr 18, 2020Updated 5 years ago
- [ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…☆15May 18, 2024Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 9 months ago
- ☆43Oct 13, 2023Updated 2 years ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆14Oct 24, 2024Updated last year
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆35Oct 26, 2025Updated 4 months ago
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- Repo du cours d'introduction à l'apprentissage par renforcement.☆15Feb 2, 2025Updated last year
- ☆41Sep 21, 2023Updated 2 years ago
- The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"☆19Jul 24, 2024Updated last year
- Transform your CapsLock into an AI key! This AutoHotkey app puts powerful AI capabilities right at your fingertips, supercharging your Wi…☆21Oct 31, 2025Updated 4 months ago
- This repository contains the code for 4th place solution for approach to RecSys Challenge 2020.☆18Sep 26, 2020Updated 5 years ago
- ☆21Aug 19, 2024Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86May 21, 2025Updated 9 months ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- ☆53May 20, 2024Updated last year
- Data distillation benchmark☆72Jun 13, 2025Updated 8 months ago
- ☆17Jun 23, 2022Updated 3 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆41Oct 31, 2025Updated 4 months ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97May 25, 2023Updated 2 years ago
- Visualization of mean field and neural tangent kernel regime☆23Jul 25, 2024Updated last year
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆40Jul 18, 2025Updated 7 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆123May 6, 2025Updated 9 months ago
- ☆39Aug 26, 2025Updated 6 months ago