Aioli: A unified optimization framework for language model data mixing
β32Jan 17, 2025Updated last year
Alternatives and similar repositories for aioli
Users that are interested in aioli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β13Aug 8, 2025Updated 7 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelsβ48Oct 31, 2023Updated 2 years ago
- A Controllable Model of Grounded Response Generation (AAAI 21)β13Oct 25, 2022Updated 3 years ago
- β51Jan 24, 2024Updated 2 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curationβ79May 2, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]β79Nov 14, 2024Updated last year
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"β10Nov 15, 2024Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typingβ14Feb 10, 2023Updated 3 years ago
- β10Jul 7, 2025Updated 8 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ60Jul 23, 2024Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function callβ20Aug 19, 2024Updated last year
- [NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"β13Oct 28, 2024Updated last year
- β33Feb 11, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The repository contains code for Adaptive Data Optimizationβ33Dec 9, 2024Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Appleβ19Nov 18, 2024Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"β39Jul 18, 2025Updated 8 months ago
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".β18Apr 25, 2025Updated 11 months ago
- Resources and code for paper "Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning"β24Jun 14, 2022Updated 3 years ago
- Exploration of automated dataset selection approaches at large scales.β52Mar 4, 2025Updated last year
- β33Jun 24, 2024Updated last year
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Languβ¦β88Dec 12, 2025Updated 3 months ago
- Unofficial Implementation of Evolutionary Model Mergingβ41Mar 28, 2024Updated last year
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- GOPHI: an AMR-to-English Verbalizerβ11Feb 5, 2020Updated 6 years ago
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β89Sep 26, 2024Updated last year
- Data Valuation without Training of a Model, submitted to ICLR'23β22Dec 30, 2022Updated 3 years ago
- [ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"β21Oct 23, 2024Updated last year
- A smattering of header files dumped using classdump-dyldβ13Apr 28, 2021Updated 4 years ago
- β25Sep 3, 2025Updated 6 months ago
- θθιθθͺηΆθ―θ¨ε€ηη«θ΅γβ10Sep 3, 2018Updated 7 years ago
- Time-ordered UUIDv4β20Jun 10, 2024Updated last year
- Emacs minor mode for entering unicode math symbolsβ11Dec 10, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- β15Nov 22, 2023Updated 2 years ago
- β44Nov 17, 2024Updated last year
- β24Dec 8, 2024Updated last year
- β12Jun 18, 2024Updated last year
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Leβ¦β13Jan 16, 2025Updated last year
- An exploration of LLM steeringβ25Jun 15, 2024Updated last year
- Knowledge Graph based Question Answering benchmark.β10Feb 1, 2020Updated 6 years ago