Pytorch Implementation of "Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models", AAAI 2025
☆38Feb 4, 2026Updated 3 weeks ago
Alternatives and similar repositories for Multi-Level-OT
Users that are interested in Multi-Level-OT are comparing it to the libraries listed below
Sorting:
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆61Aug 26, 2025Updated 6 months ago
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆28Nov 25, 2025Updated 3 months ago
- ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence☆42Aug 8, 2025Updated 6 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆40Feb 13, 2025Updated last year
- Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)☆40Aug 28, 2023Updated 2 years ago
- ☆16Jun 8, 2023Updated 2 years ago
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- [AAAI 2023] IterDE: An Iterative Knowledge Distillation Framework for Knowledge Graph Embeddings☆10Apr 3, 2024Updated last year
- [MICCAI 2024] Deep Spectral Methods for Unsupervised Ultrasound Image Interpretation☆12Jun 30, 2024Updated last year
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago
- ☆10Feb 3, 2025Updated last year
- An official repository for GPTailor☆17Jun 29, 2025Updated 8 months ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Deep learning techniques for atherosclerotic plaque detection in the carotid artery☆15Jun 16, 2022Updated 3 years ago
- A small demo for training cnn with pytorch.☆11Dec 15, 2018Updated 7 years ago
- nicolive protobuf module for TypeScript☆12Feb 2, 2026Updated last month
- Graph algorithms to merge two graphs based on stitching.☆12Oct 18, 2019Updated 6 years ago
- NODE-SELECT: A Graph Neural Network Based On A Selective Propagation Technique☆21May 4, 2022Updated 3 years ago
- Package to align tokens from different tokenizations.☆16Mar 25, 2024Updated last year
- Uses C-GAN for feature hallucination of missing modalities for hyperspectral data. TensorFlow implementation of ICCV '19 paper☆11Sep 9, 2020Updated 5 years ago
- Learning Multi-Attention Convolutional Neural Network for Fine-GrainedImage Recognition☆12May 28, 2021Updated 4 years ago
- [MICCAI'22] Unsupervised Contrastive Learning on Gall Bladder Ultrasound Videos☆11May 28, 2023Updated 2 years ago
- [NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…☆10Feb 13, 2022Updated 4 years ago
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆74Jul 14, 2025Updated 7 months ago
- PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation☆16Mar 28, 2023Updated 2 years ago
- [NeurIPS 2025 Spotlight] A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.☆45Oct 29, 2025Updated 4 months ago
- Converting JSON of COCO dataset into XML of PASCAL VOC☆11May 6, 2019Updated 6 years ago
- DSP Program written in Matlab to detect falling objects with a companion Web UI☆13Jun 14, 2015Updated 10 years ago
- Implementation of several knowledge distillation techniques on PyTorch☆15Feb 25, 2019Updated 7 years ago
- ☆14Mar 31, 2022Updated 3 years ago
- ☆14Oct 30, 2023Updated 2 years ago
- ☆11Nov 23, 2020Updated 5 years ago
- Disentangling Factors of Variation by Mixing Them codes☆16Mar 13, 2019Updated 6 years ago
- Action Rules Mining☆16Apr 22, 2025Updated 10 months ago
- ☆27Updated this week
- Adversarial Images for Variational Autoencoders☆13Nov 30, 2016Updated 9 years ago
- ☆15Apr 19, 2021Updated 4 years ago
- https://haa.boyuai.com☆49Dec 8, 2025Updated 2 months ago
- This project is a research on how to extract rules from the existing data using trained Decision Tree. The dataset used to train the mode…☆16Jun 12, 2019Updated 6 years ago