Aioli: A unified optimization framework for language model data mixing
β32Jan 17, 2025Updated last year
Alternatives and similar repositories for aioli
Users that are interested in aioli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β13Aug 8, 2025Updated 8 months ago
- A Controllable Model of Grounded Response Generation (AAAI 21)β13Oct 25, 2022Updated 3 years ago
- β52Jan 24, 2024Updated 2 years ago
- β17Mar 23, 2025Updated last year
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothingβ13Dec 6, 2022Updated 3 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]β79Nov 14, 2024Updated last year
- β16Jul 29, 2025Updated 8 months ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typingβ14Feb 10, 2023Updated 3 years ago
- β10Jul 7, 2025Updated 9 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"β17Feb 22, 2024Updated 2 years ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ60Jul 23, 2024Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)β26Feb 25, 2025Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsβ201Dec 8, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Tool4AI: A model agnostic, LLM friendly router for tool/function callβ20Aug 19, 2024Updated last year
- β16Nov 30, 2022Updated 3 years ago
- [NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"β12Dec 20, 2024Updated last year
- [NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"β13Oct 28, 2024Updated last year
- β33Feb 11, 2025Updated last year
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policiesβ30Aug 14, 2024Updated last year
- Generative Retrieval Transformerβ29Jul 23, 2023Updated 2 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Jan 23, 2025Updated last year
- The repository contains code for Adaptive Data Optimizationβ35Dec 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The original Shared Recurrent Memory Transformer implementationβ33Jul 11, 2025Updated 9 months ago
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".β18Apr 25, 2025Updated 11 months ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"β40Jul 18, 2025Updated 8 months ago
- Exploration of automated dataset selection approaches at large scales.β53Mar 4, 2025Updated last year
- β33Jun 24, 2024Updated last year
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Languβ¦β87Dec 12, 2025Updated 4 months ago
- GOPHI: an AMR-to-English Verbalizerβ11Feb 5, 2020Updated 6 years ago
- The test set for Koalaβ45Mar 31, 2023Updated 3 years ago
- Data Valuation without Training of a Model, submitted to ICLR'23β22Dec 30, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"β21Oct 23, 2024Updated last year
- A smattering of header files dumped using classdump-dyldβ14Apr 28, 2021Updated 4 years ago
- θθιθθͺηΆθ―θ¨ε€ηη«θ΅γβ10Sep 3, 2018Updated 7 years ago
- Time-ordered UUIDv4β20Jun 10, 2024Updated last year
- β14Mar 3, 2025Updated last year
- Inverse Scaling in Test-Time Computeβ25Dec 3, 2025Updated 4 months ago
- Emacs minor mode for entering unicode math symbolsβ11Dec 10, 2023Updated 2 years ago