Aioli: A unified optimization framework for language model data mixing
β32Jan 17, 2025Updated last year
Alternatives and similar repositories for aioli
Users that are interested in aioli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β14Aug 8, 2025Updated 10 months ago
- β53Jan 24, 2024Updated 2 years ago
- β20Jun 27, 2026Updated last week
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothingβ13Dec 6, 2022Updated 3 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curationβ82May 2, 2025Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]β80Nov 14, 2024Updated last year
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typingβ14Feb 10, 2023Updated 3 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ60Jul 23, 2024Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)β27Feb 25, 2025Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ204Dec 8, 2025Updated 6 months ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function callβ20Aug 19, 2024Updated last year
- β16Nov 30, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β33Feb 11, 2025Updated last year
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policiesβ30Aug 14, 2024Updated last year
- Generative Retrieval Transformerβ30Jul 23, 2023Updated 2 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β33Jan 23, 2025Updated last year
- The repository contains code for Adaptive Data Optimizationβ36Dec 9, 2024Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Appleβ19Nov 18, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementationβ36Jul 11, 2025Updated 11 months ago
- Generative Modeling with Bayesian Sample Inferenceβ24May 17, 2025Updated last year
- Resources and code for paper "Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning"β24Jun 14, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- AI_Powered_Dev_Search_Engineβ12Mar 10, 2024Updated 2 years ago
- Exploration of automated dataset selection approaches at large scales.β55Mar 4, 2025Updated last year
- β33Jun 24, 2024Updated 2 years ago
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Languβ¦β89Dec 12, 2025Updated 6 months ago
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β115Sep 26, 2024Updated last year
- The test set for Koalaβ45Mar 31, 2023Updated 3 years ago
- An implementation of unsupervised example of the Forward-Forward algorithm proposed by (Hinton, 2022)β10Jun 19, 2024Updated 2 years ago
- θθιθθͺηΆθ―θ¨ε€ηη«θ΅γβ10Sep 3, 2018Updated 7 years ago
- Code for blog post on r-squaredβ13Jul 25, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 7 months ago
- Time-ordered UUIDv4β20Jun 10, 2024Updated 2 years ago
- Emacs minor mode for entering unicode math symbolsβ11Dec 10, 2023Updated 2 years ago
- β24Dec 8, 2024Updated last year
- β44Nov 17, 2024Updated last year
- β15Nov 22, 2023Updated 2 years ago
- β12Jun 18, 2024Updated 2 years ago