2018cx/Multi-Level-OT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/2018cx/Multi-Level-OT)

2018cx / Multi-Level-OT

Pytorch Implementation of "Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models", AAAI 2025

☆38

Alternatives and similar repositories for Multi-Level-OT

Users that are interested in Multi-Level-OT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

songmzhang / DSKD
View on GitHub
Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…
☆63Mar 21, 2026Updated 4 months ago
AMAP-EAI / Nav-R2
View on GitHub
Official Implementation of paper: [Nav-R2:Dual‑Relation Reasoning for Generalizable Open‑Vocabulary Object‑Goal Navigation]
☆20Dec 10, 2025Updated 7 months ago
Nicolas-BZRD / llm-recipes
View on GitHub
☆33Mar 13, 2024Updated 2 years ago
2018cx / SinKD
View on GitHub
Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024
☆130Apr 27, 2025Updated last year
Muennighoff / FLAN
View on GitHub
Provides a minimal implementation to extract FLAN datasets for further processing
☆11Feb 1, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
jongwooko / distillm
View on GitHub
Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)
☆266Mar 13, 2025Updated last year
nhduong1203 / Chatbot
View on GitHub
☆11Mar 30, 2025Updated last year
yang3121099 / LLM-Neo
View on GitHub
The code for paper "LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models"
☆15Mar 2, 2025Updated last year
Nicolas-BZRD / llm-distillation
View on GitHub
☆11Feb 3, 2025Updated last year
philschmid / knowledge-distillation-transformers-pytorch-sagemaker
View on GitHub
☆47Feb 1, 2022Updated 4 years ago
mcneela / Sobolev
View on GitHub
Implementation of DeepMind's "Sobolev Training for Neural Networks"
☆11Apr 2, 2018Updated 8 years ago
YTianZHU / verl
View on GitHub
☆16Dec 22, 2025Updated 7 months ago
theavicaster / featurehallucination-cgan
View on GitHub
Uses C-GAN for feature hallucination of missing modalities for hyperspectral data. TensorFlow implementation of ICCV '19 paper
☆11Sep 9, 2020Updated 5 years ago
dr-mushtaq / Projects
View on GitHub
This repository is related to all about Machine Learning, Deep Learning, Computer Vision, NLP, and Research Projects
☆16Jun 7, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
cliang1453 / task-aware-distillation
View on GitHub
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
☆40Aug 28, 2023Updated 2 years ago
moskomule / distillation.pytorch
View on GitHub
Implementation of several knowledge distillation techniques on PyTorch
☆15Feb 25, 2019Updated 7 years ago
safakozdek / Color-Quantization
View on GitHub
Color quantization is the process of reducing number of colors used in an image while trying to maintain the visual appearance of the ori…
☆19Jul 25, 2021Updated 5 years ago
nghiangh / OpenViVQA
View on GitHub
This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…
☆15Dec 31, 2024Updated last year
zjunlp / PitfallsKnowledgeEditing
View on GitHub
[ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models
☆22Jun 13, 2024Updated 2 years ago
hangwu2021 / 360SurroundView
View on GitHub
☆20Sep 24, 2022Updated 3 years ago
Zzzzz1 / CSKD
View on GitHub
Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…
☆15Nov 5, 2023Updated 2 years ago
vcl-iisc / ZSKD
View on GitHub
Zero-Shot Knowledge Distillation in Deep Networks
☆67Apr 16, 2022Updated 4 years ago
Magic-chao / rssrai2019_scene_classification
View on GitHub
Scene classification baseline. Test Acc:90.14%
☆16Jul 9, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
neda77aa / FTC
View on GitHub
This repo holds the code for: {Transformer-based Spatio-temporal Analysis for Automatic Classification of Aortic Stenosis Severity from B…
☆12Nov 29, 2022Updated 3 years ago
jasonseu / MultiLabelClassification
View on GitHub
a codebase for multi label classification with PyTorch.
☆15Nov 23, 2022Updated 3 years ago
anonymouscvpr1983 / GAL
View on GitHub
Towards Optimal Structured CNN Pruning via Generative Adversarial Learning
☆18Mar 23, 2019Updated 7 years ago
open-evals / evals
View on GitHub
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
☆18Mar 23, 2023Updated 3 years ago
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
qnguyen3 / nanoLLaVA
View on GitHub
World's Smallest Vision-Language Model
☆35Apr 7, 2024Updated 2 years ago
abonte / protopdebug
View on GitHub
Implementation of Concept-level Debugging of Part-Prototype Networks
☆12May 9, 2023Updated 3 years ago
hhhh1138 / VDOT
View on GitHub
[CVPR 2026] VDOT: Efficient Unified Video Creation via Optimal Transport Distillation
☆18Mar 16, 2026Updated 4 months ago
VITA-Group / Nasty-Teacher
View on GitHub
[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…
☆83Dec 30, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
gbc-iitd / US_UCL
View on GitHub
[MICCAI'22] Unsupervised Contrastive Learning on Gall Bladder Ultrasound Videos
☆11May 28, 2023Updated 3 years ago
DelTA-Lab-IITK / shad3s
View on GitHub
☆14Mar 31, 2022Updated 4 years ago
bangoc123 / multi-agents-design-patterns
View on GitHub
Multi-agent design pattern and agent evaluation process
☆23Mar 27, 2025Updated last year
sail-sg / LongSpec
View on GitHub
[ACL 2026 (Main)] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
☆84Jul 14, 2025Updated last year
WongiPark0628 / RAL
View on GitHub
[ICCVW'23] Robust Asymmetric Loss for Multi-Label Long-Tailed Learning
☆19Oct 3, 2023Updated 2 years ago
buptrabbit / Information-Gain
View on GitHub
This is a python-based program which implements Information-Gain Algorithm used for Text Features Selection
☆11Sep 9, 2014Updated 11 years ago
DennisLeoUTS / improved-bilinear-pooling
View on GitHub
This is pytorch implementation of bilinear pooling and its matrix normalized version, iBCNN.
☆16Jun 4, 2019Updated 7 years ago