The repository contains code for Adaptive Data Optimization
β36Dec 9, 2024Updated last year
Alternatives and similar repositories for ado
Users that are interested in ado are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.β14Jan 9, 2024Updated 2 years ago
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β14Aug 8, 2025Updated 10 months ago
- A simple and efficient baseline for data attributionβ11Nov 10, 2023Updated 2 years ago
- ACL24β11Jun 7, 2024Updated 2 years ago
- β11Jul 21, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Repository for Dataset Inference for LLMsβ41Jul 25, 2024Updated last year
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Languβ¦β88Dec 12, 2025Updated 6 months ago
- Forcing Diffuse Distributions out of Language Modelsβ18Sep 10, 2024Updated last year
- β12Oct 20, 2023Updated 2 years ago
- Generating Potent Poisons and Backdoors from Scratch with Guided Diffusionβ11Apr 1, 2024Updated 2 years ago
- β18Oct 12, 2022Updated 3 years ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]β21May 2, 2024Updated 2 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".β24Mar 25, 2025Updated last year
- β11Oct 20, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Training vision models with full-batch gradient descent and regularizationβ40Feb 14, 2023Updated 3 years ago
- β13Dec 12, 2025Updated 6 months ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"β25Dec 12, 2023Updated 2 years ago
- β30Jun 19, 2023Updated 2 years ago
- Aioli: A unified optimization framework for language model data mixingβ32Jan 17, 2025Updated last year
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)β35Sep 28, 2025Updated 8 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Trainingβ23Aug 18, 2024Updated last year
- PAL: Proxy-Guided Black-Box Attack on Large Language Modelsβ56Aug 17, 2024Updated last year
- β15Oct 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2026] Esoteric Language Modelsβ118May 1, 2026Updated last month
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teachesβ64Mar 4, 2025Updated last year
- Pytorch ImageNet1k Loader with Bounding Boxes.β13Jan 23, 2022Updated 4 years ago
- Source code of "What can linearized neural networks actually say about generalization?β20Oct 21, 2021Updated 4 years ago
- β21Apr 3, 2026Updated 2 months ago
- Data for "Datamodels: Predicting Predictions with Training Data"β96May 25, 2023Updated 3 years ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).β24Apr 26, 2025Updated last year
- Levin tree search guided by both a policy and a heuristic functionβ19Jul 13, 2023Updated 2 years ago
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learningβ14Jun 3, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.β113Nov 22, 2023Updated 2 years ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.β205Jul 17, 2024Updated last year
- Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learningβ17May 14, 2023Updated 3 years ago
- β10Jul 13, 2024Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"β32Jun 5, 2025Updated last year
- Face Recognition on NVIDIA TX2β10Sep 5, 2018Updated 7 years ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMsβ98Nov 17, 2024Updated last year