HazyResearch/aioli

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HazyResearch/aioli)

HazyResearch / aioli

Aioli: A unified optimization framework for language model data mixing

☆33

Alternatives and similar repositories for aioli

Users that are interested in aioli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CodeCreator / WebOrganizer
View on GitHub
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
☆83May 2, 2025Updated last year
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆14Aug 8, 2025Updated 11 months ago
HazyResearch / skill-it
View on GitHub
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆48Oct 31, 2023Updated 2 years ago
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
microsoft / encoder-decoder-slm
View on GitHub
Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…
☆32Feb 7, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
john-hewitt / truncation-sampling
View on GitHub
Codebase describing experiments in Truncation Sampling as Language Model Desmoothing
☆13Dec 6, 2022Updated 3 years ago
cxcscmu / MATES
View on GitHub
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
☆80Nov 14, 2024Updated last year
EleutherAI / semantic-memorization
View on GitHub
☆44Nov 17, 2024Updated last year
Jiachen-T-Wang / GREATS
View on GitHub
☆20Jun 27, 2026Updated 3 weeks ago
renll / SparseLT
View on GitHub
[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Feb 10, 2023Updated 3 years ago
waefrebeorn / KAN-WuBu-Memory
View on GitHub
An AI character interaction system with emotional modeling and advanced memory management
☆17Oct 26, 2024Updated last year
ruizheliUOA / ARC_JSD
View on GitHub
A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation
☆15Aug 28, 2025Updated 10 months ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
sail-sg / Rigging-ChatbotArena
View on GitHub
Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)
☆27Feb 25, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
October2001 / ProLong
View on GitHub
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆61Jul 23, 2024Updated 2 years ago
princeton-nlp / QuRating
View on GitHub
[ICML 2024] Selecting High-Quality Data for Training Language Models
☆204Dec 8, 2025Updated 7 months ago
tmlr-group / CoPA
View on GitHub
[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"
☆11Nov 15, 2024Updated last year
wong-justin / clark
View on GitHub
Trim and timestamp audio, in the terminal
☆14Oct 14, 2024Updated last year
PRIME-RL / RL-Compositionality
View on GitHub
FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆68Jan 26, 2026Updated 5 months ago
MadryLab / datamodels-data
View on GitHub
Data for "Datamodels: Predicting Predictions with Training Data"
☆97May 25, 2023Updated 3 years ago
tmlr-group / BayesianLM
View on GitHub
[NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"
☆12Dec 20, 2024Updated last year
microsoft / GRTr
View on GitHub
Generative Retrieval Transformer
☆30Jul 23, 2023Updated 3 years ago
PAIR-code / pretraining-tda
View on GitHub
☆33Feb 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
martenlienen / bsi
View on GitHub
Generative Modeling with Bayesian Sample Inference
☆24May 17, 2025Updated last year
tmlr-group / SCT
View on GitHub
[NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"
☆13Oct 28, 2024Updated last year
HazyResearch / domino
View on GitHub
☆143Oct 30, 2023Updated 2 years ago
OpenHands / agent-analysis
View on GitHub
A collection of scripts and tools for analyzing SWE agents.
☆16May 7, 2025Updated last year
kyegomez / MLXTransformer
View on GitHub
Simple Implementation of a Transformer in the new framework MLX by Apple
☆19Nov 18, 2024Updated last year
hkust-nlp / PreSelect
View on GitHub
[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches
☆66Mar 4, 2025Updated last year
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated last year
shauli-ravfogel / adv-kernel-removal
View on GitHub
☆12Oct 23, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
LFhase / HIGHT
View on GitHub
[ICML 2025] Hierarchical Graph Tokenization for Molecule-Language Alignment
☆16Aug 18, 2025Updated 11 months ago
Sphere-AI-Lab / fda
View on GitHub
Implementation of <Model Merging with Functional Dual Anchors>
☆46Nov 23, 2025Updated 8 months ago
tmlr-group / NoisyRationales
View on GitHub
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆40Jul 18, 2025Updated last year
JJchy / CG_score
View on GitHub
Data Valuation without Training of a Model, submitted to ICLR'23
☆22Dec 30, 2022Updated 3 years ago
Nanami18 / Snowballed_Hallucination
View on GitHub
☆43Sep 3, 2024Updated last year
TheDuckAI / prm
View on GitHub
☆12Jan 17, 2025Updated last year
MLforHealth / MIMIC_Generalisation
View on GitHub
Code to study the generalisability of benchmark models on non-stationary EHRs.
☆15Aug 7, 2019Updated 6 years ago