alon-albalak/online-data-mixing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alon-albalak/online-data-mixing)

alon-albalak / online-data-mixing

An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.

☆14

Alternatives and similar repositories for online-data-mixing

Users that are interested in online-data-mixing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yegcjs / mixinglaws
View on GitHub
☆113Jul 15, 2025Updated last year
davidbrandfonbrener / color-filter-olmo
View on GitHub
☆13Dec 12, 2025Updated 7 months ago
y0mingzhang / diffuse-distributions
View on GitHub
Forcing Diffuse Distributions out of Language Models
☆18Sep 10, 2024Updated last year
yidingjiang / ado
View on GitHub
The repository contains code for Adaptive Data Optimization
☆37Dec 9, 2024Updated last year
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆14Aug 8, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ekinakyurek / influence
View on GitHub
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆40Dec 27, 2022Updated 3 years ago
circle-hit / MuCDN
View on GitHub
Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…
☆10Jul 21, 2023Updated 3 years ago
alon-albalak / data-selection-survey
View on GitHub
A Survey on Data Selection for Language Models
☆261Apr 29, 2025Updated last year
kwaipilot / SWE-Compass
View on GitHub
☆18Mar 28, 2026Updated 4 months ago
hamishivi / automated-instruction-selection
View on GitHub
Exploration of automated dataset selection approaches at large scales.
☆55Mar 4, 2025Updated last year
SCIR-SC-Qiaoban-Team / FreeEvalLM
View on GitHub
[AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…
☆11Feb 7, 2026Updated 5 months ago
HazyResearch / skill-it
View on GitHub
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆48Oct 31, 2023Updated 2 years ago
allenai / hybrid-preferences
View on GitHub
Learning to route instances for Human vs AI Feedback (ACL Main '25)
☆29Jul 23, 2025Updated last year
LuoXiaoHeics / Continual-Tune
View on GitHub
☆10Feb 6, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Linestro / GRACE
View on GitHub
Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference
☆18Mar 5, 2023Updated 3 years ago
chenllliang / ParetoMNMT
View on GitHub
Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023
☆17Sep 27, 2023Updated 2 years ago
circle-hit / Lens
View on GitHub
Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"
☆12Oct 15, 2024Updated last year
AIYA-ai / tongue_diagnosis_system
View on GitHub
本项目主要使用deeplabv3+和yolov7对舌像进行识别,同时拥有基于知识图谱的多轮对话系统
☆10Oct 12, 2024Updated last year
alon-albalak / FLAD
View on GitHub
Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago
hkust-nlp / PEM_composition
View on GitHub
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
☆61Nov 26, 2023Updated 2 years ago
pietrolesci / memorisation-profiles
View on GitHub
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆25Mar 25, 2025Updated last year
Raincleared-Song / ConPET
View on GitHub
Source code for a LoRA-based continual relation extraction method.
☆14Sep 25, 2023Updated 2 years ago
YuhanLiu11 / AutoFreeze
View on GitHub
☆22Nov 20, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
circle-hit / KBCIN
View on GitHub
Code for AAAI 2023 accepted paper titled "Knowledge-Bridged Causal Interaction Network for Causal Emotion Entailment"
☆14May 6, 2023Updated 3 years ago
hitz-zentroa / lm-contamination
View on GitHub
The LM Contamination Index is a manually created database of contamination evidences for LMs.
☆81Apr 11, 2024Updated 2 years ago
zs1314 / Fraesormer
View on GitHub
【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"
☆13Mar 21, 2025Updated last year
MinkaiXu / fPO
View on GitHub
f-PO: Generalizing Preference Optimization with f-divergence Minimization
☆14Apr 2, 2025Updated last year
pipixia-labs / creative-claw
View on GitHub
☆37Jul 9, 2026Updated 2 weeks ago
zhuzilin / vllm-group
View on GitHub
☆12Nov 5, 2024Updated last year
ShiyuNee / Awesome-Calibration-Papers
View on GitHub
A curated list of awesome papers about calibration
☆15May 6, 2024Updated 2 years ago
Dicer-Zz / EPI
View on GitHub
Code for the paper: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation
☆13May 16, 2023Updated 3 years ago
qiuzh20 / EMoE
View on GitHub
Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]
☆39May 28, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pengwei-iie / GLHG
View on GitHub
☆12Mar 12, 2024Updated 2 years ago
Jianguo99 / OpenDLM
View on GitHub
OpenDLM is an open-source library focused on sampling algorithms for Diffusion Language Models (DLMs).
☆15Aug 5, 2025Updated 11 months ago
LCS2-IIITD / Emotion-Flip-Reasoning
View on GitHub
☆14Aug 7, 2023Updated 2 years ago
supersupercong / MSGNN
View on GitHub
[IJCAI-24] Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks
☆11Sep 2, 2024Updated last year
Kroangine-Xia / Design-of-a-Gesture-Recognition-based-Robotic-Arm-Control-System
View on GitHub
☆15Jun 19, 2024Updated 2 years ago
cxcscmu / MATES
View on GitHub
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
☆80Nov 14, 2024Updated last year
circle-hit / TransESC
View on GitHub
Code for ACL 2023 (Findings) accepted paper titled "TransESC: Smoothing Emotional Support Conversation via Turn-Level State Transition"
☆16Jun 16, 2023Updated 3 years ago