golololologol/LLM-Distillery

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/golololologol/LLM-Distillery)

golololologol / LLM-Distillery

A pipeline for LLM knowledge distillation

☆116

Alternatives and similar repositories for LLM-Distillery

Users that are interested in LLM-Distillery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arcee-ai / DistillKit
View on GitHub
An Open Source Toolkit For LLM Distillation
☆991May 12, 2026Updated 2 months ago
jongwooko / distillm
View on GitHub
Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)
☆267Mar 13, 2025Updated last year
SinatrasC / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆17Oct 9, 2024Updated last year
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
Tebmer / Awesome-Knowledge-Distillation-of-LLMs
View on GitHub
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…
☆1,296Mar 9, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
lovit / kowikitext
View on GitHub
☆19Jan 17, 2021Updated 5 years ago
CarperAI / treasure_trove
View on GitHub
☆21Aug 27, 2023Updated 2 years ago
wutaiqiang / LLM_KD_AKL
View on GitHub
☆22Oct 22, 2024Updated last year
SeunghyunSEO / optimized_hf_llama_class_for_training
View on GitHub
☆47Aug 29, 2024Updated last year
plm-team / PLM
View on GitHub
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
☆21Mar 18, 2025Updated last year
catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
JD-P / RetroInstruct
View on GitHub
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆34Oct 8, 2025Updated 9 months ago
axeld5 / pali_reason
View on GitHub
Testing paligemma2 finetuning on reasoning dataset
☆18Dec 28, 2024Updated last year
horus-ai-labs / DistillFlow
View on GitHub
Library for model distillation
☆169Sep 6, 2025Updated 10 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
migtissera / Sensei
View on GitHub
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆221Apr 29, 2024Updated 2 years ago
agokrani / distillKitPlus
View on GitHub
Easy to use, High Performant Knowledge Distillation for LLMs
☆98May 5, 2025Updated last year
Pints-AI / 1.5-Pints
View on GitHub
A compact LLM pretrained in 9 days by using high quality data
☆342Apr 9, 2025Updated last year
jxqu3 / aiui
View on GitHub
A simple no-install web UI for Ollama and OAI-Compatible APIs!
☆31Jan 30, 2025Updated last year
FeiyuZhang98 / IncreLoRA
View on GitHub
☆36Aug 23, 2023Updated 2 years ago
kyegomez / MultiQueryAttention
View on GitHub
This is a simple torch implementation of the high performance Multi-Query Attention
☆16Aug 23, 2023Updated 2 years ago
cloneofsimo / minSAE
View on GitHub
☆30Dec 2, 2024Updated last year
philippe-eecs / vitok
View on GitHub
☆34May 14, 2025Updated last year
main-horse / hnet-old
View on GitHub
H-Net Dynamic Hierarchical Architecture
☆81Sep 11, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
JD-P / minihf
View on GitHub
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆184Nov 6, 2025Updated 8 months ago
rosmineb / unit_test_rl
View on GitHub
Project code for training LLMs to write better unit tests + code
☆22May 19, 2025Updated last year
Percent-BFD / neurips_submission
View on GitHub
☆17Nov 23, 2023Updated 2 years ago
joey00072 / ohara
View on GitHub
Collection of autoregressive model implementation
☆84Jun 10, 2026Updated last month
JacksonCakes / vision-r1
View on GitHub
☆13Mar 23, 2025Updated last year
zhourunlong / Reflect-RL
View on GitHub
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
☆18Jul 19, 2025Updated last year
Nardien / agent-distillation
View on GitHub
Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"
☆250Oct 22, 2025Updated 9 months ago
FasterDecoding / TEAL
View on GitHub
☆168Feb 15, 2025Updated last year
bbartoldson / TBA
View on GitHub
Official implementation of TBA for async LLM post-training.
☆32Nov 5, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
menhguin / minp_paper
View on GitHub
Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper
☆51Aug 13, 2025Updated 11 months ago
liutianlin0121 / decoding-time-realignment
View on GitHub
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated 2 years ago
Phoenix8215 / build_neural_network_from_scratch_CPP
View on GitHub
Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.
☆11Jul 27, 2024Updated 2 years ago
ramsrigouthamg / BERT_generate_grammar_MCQ_from_news_article
View on GitHub
Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.
☆13Oct 2, 2019Updated 6 years ago
brendanhogan / DeepSeekRL-Extended
View on GitHub
Exploring Applications of GRPO
☆252Aug 25, 2025Updated 11 months ago
QuixiAI / spectrum
View on GitHub
☆145Aug 20, 2025Updated 11 months ago
xjdr-alt / muzero_sketch
View on GitHub
☆40Jul 26, 2024Updated 2 years ago