Aioli: A unified optimization framework for language model data mixing
β32Jan 17, 2025Updated last year
Alternatives and similar repositories for aioli
Users that are interested in aioli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β14Aug 8, 2025Updated 9 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelsβ48Oct 31, 2023Updated 2 years ago
- A Controllable Model of Grounded Response Generation (AAAI 21)β13Oct 25, 2022Updated 3 years ago
- β53Jan 24, 2024Updated 2 years ago
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothingβ13Dec 6, 2022Updated 3 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curationβ81May 2, 2025Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]β79Nov 14, 2024Updated last year
- β16Jul 29, 2025Updated 9 months ago
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"β11Nov 15, 2024Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typingβ14Feb 10, 2023Updated 3 years ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)β27Feb 25, 2025Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function callβ20Aug 19, 2024Updated last year
- β16Nov 30, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"β12Dec 20, 2024Updated last year
- β33Feb 11, 2025Updated last year
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policiesβ30Aug 14, 2024Updated last year
- Generative Retrieval Transformerβ30Jul 23, 2023Updated 2 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Jan 23, 2025Updated last year
- The repository contains code for Adaptive Data Optimizationβ36Dec 9, 2024Updated last year
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".β18Apr 25, 2025Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"β40Jul 18, 2025Updated 10 months ago
- This repository contains all of the code used in the blog post, A guide to Machine Learning on iPhone : Intro to Apple's CoreMLβ20Sep 25, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Exploration of automated dataset selection approaches at large scales.β54Mar 4, 2025Updated last year
- β33Jun 24, 2024Updated last year
- My personal research notebook with notes, tutorials, and resources written in Jupyterbook.β21May 14, 2026Updated last week
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β116Sep 26, 2024Updated last year
- GOPHI: an AMR-to-English Verbalizerβ11Feb 5, 2020Updated 6 years ago
- Data Valuation without Training of a Model, submitted to ICLR'23β22Dec 30, 2022Updated 3 years ago
- θθιθθͺηΆθ―θ¨ε€ηη«θ΅γβ10Sep 3, 2018Updated 7 years ago
- Code for blog post on r-squaredβ13Jul 25, 2016Updated 9 years ago
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Time-ordered UUIDv4β20Jun 10, 2024Updated last year
- Emacs minor mode for entering unicode math symbolsβ11Dec 10, 2023Updated 2 years ago
- β44Nov 17, 2024Updated last year
- β15Nov 22, 2023Updated 2 years ago
- β24Dec 8, 2024Updated last year
- β12Jun 18, 2024Updated last year
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Leβ¦β14Jan 16, 2025Updated last year