Aioli: A unified optimization framework for language model data mixing
β32Jan 17, 2025Updated last year
Alternatives and similar repositories for aioli
Users that are interested in aioli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β14Aug 8, 2025Updated 8 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Modelsβ48Oct 31, 2023Updated 2 years ago
- β53Jan 24, 2024Updated 2 years ago
- β17Mar 23, 2025Updated last year
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothingβ13Dec 6, 2022Updated 3 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]β79Nov 14, 2024Updated last year
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"β11Nov 15, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ60Jul 23, 2024Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)β27Feb 25, 2025Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ202Dec 8, 2025Updated 4 months ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function callβ20Aug 19, 2024Updated last year
- β16Nov 30, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"β13Oct 28, 2024Updated last year
- β33Feb 11, 2025Updated last year
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policiesβ30Aug 14, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Jan 23, 2025Updated last year
- The repository contains code for Adaptive Data Optimizationβ36Dec 9, 2024Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Appleβ19Nov 18, 2024Updated last year
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".β18Apr 25, 2025Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"β40Jul 18, 2025Updated 9 months ago
- The original Shared Recurrent Memory Transformer implementationβ35Jul 11, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [Preprint] Graph State Space Convolution (GSSC)β14Jun 11, 2024Updated last year
- AI_Powered_Dev_Search_Engineβ12Mar 10, 2024Updated 2 years ago
- Exploration of automated dataset selection approaches at large scales.β54Mar 4, 2025Updated last year
- My personal research notebook with notes, tutorials, and resources written in Jupyterbook.β21Updated this week
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Languβ¦β88Dec 12, 2025Updated 4 months ago
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β118Sep 26, 2024Updated last year
- The test set for Koalaβ45Mar 31, 2023Updated 3 years ago
- An implementation of unsupervised example of the Forward-Forward algorithm proposed by (Hinton, 2022)β10Jun 19, 2024Updated last year
- [ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"β21Oct 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- θθιθθͺηΆθ―θ¨ε€ηη«θ΅γβ10Sep 3, 2018Updated 7 years ago
- Code for blog post on r-squaredβ13Jul 25, 2016Updated 9 years ago
- Time-ordered UUIDv4β20Jun 10, 2024Updated last year
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 5 months ago
- β15Mar 3, 2025Updated last year
- β44Nov 17, 2024Updated last year
- β15Nov 22, 2023Updated 2 years ago