Aioli: A unified optimization framework for language model data mixing
☆32Jan 17, 2025Updated last year
Alternatives and similar repositories for aioli
Users that are interested in aioli are comparing it to the libraries listed below
Sorting:
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Aug 19, 2024Updated last year
- ☆51Jan 24, 2024Updated 2 years ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆48Oct 31, 2023Updated 2 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- ☆17Mar 23, 2025Updated 11 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆79Nov 14, 2024Updated last year
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆18Apr 25, 2025Updated 10 months ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 9 months ago
- Official repository of Wavehax vocoder☆66Dec 20, 2025Updated 2 months ago
- ☆26Jun 10, 2024Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆78May 2, 2025Updated 10 months ago
- ☆37Sep 21, 2025Updated 5 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆37Feb 22, 2025Updated last year
- The repository contains code for Adaptive Data Optimization☆32Dec 9, 2024Updated last year
- This tool allows local LLM usage that can automate tasks without human interventention. The agent can call itself recursively and work on…☆20May 5, 2025Updated 10 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Feb 23, 2026Updated last week
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- ☆32Feb 11, 2025Updated last year
- Aalto scientific computing guide: former Triton user guide + more info☆33Feb 19, 2026Updated 2 weeks ago
- Neural topic modeling☆29Aug 19, 2020Updated 5 years ago
- ☆38Nov 13, 2025Updated 3 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- GPI-Space: Memory Driven Computing and Big Data☆10Jan 2, 2025Updated last year
- Exploratory Data Analysis of Time Series Data and Forecasting using Naïve Approach, Moving Average Method, Simple Exponential Smoothenin…☆12Jul 2, 2018Updated 7 years ago
- ☆13Nov 21, 2025Updated 3 months ago
- Self-evaluating RAG application on LangCheck docs☆11Sep 10, 2025Updated 5 months ago
- Generative and Parametric design code: featuring Processing / Python / Javascript / HTML / CSS☆14Nov 4, 2020Updated 5 years ago
- ☆16Jul 7, 2025Updated 7 months ago
- ☆37Dec 19, 2024Updated last year
- Реализация sklearn-based Transformer-а для Weight of Evidence преобразования☆10May 6, 2020Updated 5 years ago
- 🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper☆39Jun 28, 2025Updated 8 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆39Jul 18, 2025Updated 7 months ago
- Watsonx Assistant with Milvus as Vector Database☆12Mar 31, 2025Updated 11 months ago
- Program uses cv2 to display many streams from cameras, web pages, local files☆13Jan 31, 2021Updated 5 years ago