Aioli: A unified optimization framework for language model data mixing
β32Jan 17, 2025Updated last year
Alternatives and similar repositories for aioli
Users that are interested in aioli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β14Aug 8, 2025Updated 10 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelsβ48Oct 31, 2023Updated 2 years ago
- A Controllable Model of Grounded Response Generation (AAAI 21)β13Oct 25, 2022Updated 3 years ago
- β18Mar 23, 2025Updated last year
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothingβ13Dec 6, 2022Updated 3 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curationβ81May 2, 2025Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]β79Nov 14, 2024Updated last year
- β16Jul 29, 2025Updated 10 months ago
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"β11Nov 15, 2024Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typingβ14Feb 10, 2023Updated 3 years ago
- β10Jul 7, 2025Updated 11 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ60Jul 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)β27Feb 25, 2025Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function callβ20Aug 19, 2024Updated last year
- β16Nov 30, 2022Updated 3 years ago
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policiesβ30Aug 14, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β33Jan 23, 2025Updated last year
- The repository contains code for Adaptive Data Optimizationβ36Dec 9, 2024Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Appleβ19Nov 18, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementationβ36Jul 11, 2025Updated 11 months ago
- Generative Modeling with Bayesian Sample Inferenceβ24May 17, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Exploration of automated dataset selection approaches at large scales.β55Mar 4, 2025Updated last year
- β33Jun 24, 2024Updated last year
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Languβ¦β88Dec 12, 2025Updated 6 months ago
- My personal research notebook with notes, tutorials, and resources written in Jupyterbook.β21Updated this week
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β116Sep 26, 2024Updated last year
- Data Valuation without Training of a Model, submitted to ICLR'23β22Dec 30, 2022Updated 3 years ago
- The test set for Koalaβ45Mar 31, 2023Updated 3 years ago
- θθιθθͺηΆθ―θ¨ε€ηη«θ΅γβ10Sep 3, 2018Updated 7 years ago
- Code for blog post on r-squaredβ13Jul 25, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 6 months ago
- Time-ordered UUIDv4β20Jun 10, 2024Updated 2 years ago
- Emacs minor mode for entering unicode math symbolsβ11Dec 10, 2023Updated 2 years ago
- β24Dec 8, 2024Updated last year
- β44Nov 17, 2024Updated last year
- β15Nov 22, 2023Updated 2 years ago
- β12Jun 18, 2024Updated last year