apple/ml-dataset-decomposition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apple/ml-dataset-decomposition)

apple / ml-dataset-decomposition

Official repo of dataset-decomposition paper [NeurIPS 2024]

☆21

Alternatives and similar repositories for ml-dataset-decomposition

Users that are interested in ml-dataset-decomposition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

buyi-Yang / getQzonehistory
View on GitHub
☆12Nov 13, 2024Updated last year
tiremoscode / dw-grupo58
View on GitHub
☆20Nov 28, 2024Updated last year
GovardhaneNitin / smart-inventory
View on GitHub
A smart inventory management system that includes real-time stock tracking, supplier management, predictive analytics for inventory forec…
☆16Apr 22, 2025Updated last year
kinesiatricssxilm14 / CodeRepoQA
View on GitHub
CodeRepoQA dataset
☆15Feb 19, 2025Updated last year
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AndreaGrandieri / ing-sw-2024-codex-naturalis
View on GitHub
Progetto per la prova finale di Ingegneria del Software 2023-2024 al Politecnico di Milano
☆10Oct 19, 2024Updated last year
dragonjsq / -VPN
View on GitHub
免费梯子，免费VPN，真正免费的的VPN，shadowsocks,v2rey,官网地址www.dragonvpn.cc
☆13Sep 4, 2024Updated last year
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
sheryc / resonance_rope
View on GitHub
[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.
☆24Mar 5, 2024Updated 2 years ago
shawntan / stickbreaking-attention
View on GitHub
Stick-breaking attention
☆63Jul 1, 2025Updated last year
SDLAML / disco
View on GitHub
☆16Dec 11, 2025Updated 7 months ago
AllanYangZhou / midGPT
View on GitHub
Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.
☆27Sep 29, 2024Updated last year
google-deepmind / exedec
View on GitHub
☆14May 9, 2024Updated 2 years ago
jiah-li / magic
View on GitHub
The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.
☆15Dec 16, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
moayedellah / Network-Security
View on GitHub
A curated collection of courses, videos, and resources to master network security from the ground up.
☆11Jan 6, 2025Updated last year
DavidWBressler / adaptivesoftmax
View on GitHub
☆12Nov 25, 2018Updated 7 years ago
ShabanMughal / Robot-Ai
View on GitHub
☆22Jan 1, 2026Updated 6 months ago
okarthikb / state-space-models
View on GitHub
☆27Jul 9, 2024Updated 2 years ago
AgustinCoding / identity-alchemist
View on GitHub
Identity Alchemist: A powerful Python-based tool for generating and managing synthetic identities. Features machine learning integration,…
☆12Feb 12, 2025Updated last year
dream3d-ai / torch-submit
View on GitHub
☆10Dec 21, 2024Updated last year
gmlwns2000 / sea-attention
View on GitHub
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
☆12Jun 20, 2025Updated last year
timlautk / polargrad
View on GitHub
PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective
☆18Oct 1, 2025Updated 9 months ago
muellerzr / import-timer
View on GitHub
Pragmatic approach to parsing import profiles for CI's
☆12Jul 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bluvolve-dev / reactive-course-service-with-nextjs-ui-
View on GitHub
☆11Oct 15, 2020Updated 5 years ago
D2I-ai / Route
View on GitHub
ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)
☆16May 15, 2025Updated last year
url-kaist / HeLiMOS-visualizer
View on GitHub
A LiDAR visualization tool for HeLiMOS dataset
☆26May 8, 2026Updated 2 months ago
KorAP / Tokenizer-Evaluation
View on GitHub
Benchmark scripts for comparing different tokenizers and sentence segmenters of German
☆12Feb 27, 2023Updated 3 years ago
WOWNICE / ssl-small
View on GitHub
Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".
☆17Dec 15, 2021Updated 4 years ago
princeton-pli / MeCo
View on GitHub
Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆51Jun 30, 2025Updated last year
haebin-seong / HarmAug
View on GitHub
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models
☆13Mar 6, 2025Updated last year
graphcore-research / jax-scalify
View on GitHub
JAX Scalify: end-to-end scaled arithmetics
☆18Oct 30, 2024Updated last year
teslamotors / LVCS
View on GitHub
LVCS@Tesla.com
☆13May 20, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JonasGeiping / dataaugs
View on GitHub
☆18Oct 12, 2022Updated 3 years ago
RUC-GSAI / YuLan-Mini
View on GitHub
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆232Jul 25, 2025Updated 11 months ago
polo5 / FDS
View on GitHub
Gradient-based Hyperparameter Optimization Over Long Horizons
☆14Sep 29, 2021Updated 4 years ago
monologg / py-backtrans
View on GitHub
Python library for backtranslation (with Google Translate)
☆12Jan 11, 2020Updated 6 years ago
rbiswasfc / llm-science-exam
View on GitHub
6th Position Solution Code for Kaggle - LLM Science Exam Competition
☆24Jul 8, 2024Updated 2 years ago
microsoft / jackknife-variational-inference
View on GitHub
Demonstration of Jackknife Variational Inference for Variational Autoencoders, related to ICLR 2018 paper.
☆22Feb 21, 2018Updated 8 years ago
cloneofsimo / zeroshampoo
View on GitHub
☆33Sep 10, 2024Updated last year