Pytorch Implementation of "Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models", AAAI 2025
☆38Feb 4, 2026Updated last month
Alternatives and similar repositories for Multi-Level-OT
Users that are interested in Multi-Level-OT are comparing it to the libraries listed below
Sorting:
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆61Mar 7, 2026Updated 2 weeks ago
- ☆31Mar 13, 2024Updated 2 years ago
- Official code for 'TraM: Enhancing User Sleep Prediction with Transformer-Based Multivariate Time Series Modeling and Machine Learning En…☆19Jul 3, 2024Updated last year
- ☆21Jul 9, 2025Updated 8 months ago
- Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024☆120Apr 27, 2025Updated 10 months ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Fine-tune GPT2 to generate fake job experiences☆11Jan 17, 2023Updated 3 years ago
- ☆11Feb 3, 2025Updated last year
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆44Feb 13, 2025Updated last year
- ☆48Feb 1, 2022Updated 4 years ago
- A small demo for training cnn with pytorch.☆11Dec 15, 2018Updated 7 years ago
- [ICLR 2025] Official repository for the paper "Influence-Guided Diffusion for Dataset Distillation".☆15Feb 12, 2025Updated last year
- Free chrome extension to summarize articles on the web using ChatGPT AI☆18Jan 7, 2023Updated 3 years ago
- ☆14Sep 9, 2024Updated last year
- Uses C-GAN for feature hallucination of missing modalities for hyperspectral data. TensorFlow implementation of ICCV '19 paper☆11Sep 9, 2020Updated 5 years ago
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- Implementation of several knowledge distillation techniques on PyTorch☆15Feb 25, 2019Updated 7 years ago
- ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence☆44Aug 8, 2025Updated 7 months ago
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- Role-Wise Data Augmentation for Knowledge Distillation☆19Nov 22, 2022Updated 3 years ago
- Ipython notebooks of walk-trough Transformer model implementations in PyTorch and GPT-2 fine-tuning.☆24Jan 9, 2020Updated 6 years ago
- [AAAI 2023] IterDE: An Iterative Knowledge Distillation Framework for Knowledge Graph Embeddings☆10Apr 3, 2024Updated last year
- Zero-Shot Knowledge Distillation in Deep Networks☆67Apr 16, 2022Updated 3 years ago
- a codebase for multi label classification with PyTorch.☆15Nov 23, 2022Updated 3 years ago
- Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.☆18Mar 23, 2023Updated 2 years ago
- Deep learning techniques for atherosclerotic plaque detection in the carotid artery☆15Jun 16, 2022Updated 3 years ago
- Towards Optimal Structured CNN Pruning via Generative Adversarial Learning☆18Mar 23, 2019Updated 6 years ago
- [MICCAI 2024] Deep Spectral Methods for Unsupervised Ultrasound Image Interpretation☆12Jun 30, 2024Updated last year
- Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.☆20Oct 27, 2021Updated 4 years ago
- GitHub repository for KDD 2021 work: ProtoPShare: Prototypical Parts Sharing for Similarity Discovery in Interpretable Image Classificati…☆14May 30, 2021Updated 4 years ago
- 基于 rasa 1.x 版本搭建的中文天气查询 demo | A simple & micro Chinese Weatherbot based on rasa framework☆12Aug 14, 2019Updated 6 years ago
- Implementation of Concept-level Debugging of Part-Prototype Networks☆12May 9, 2023Updated 2 years ago
- Learning Multi-Attention Convolutional Neural Network for Fine-GrainedImage Recognition☆12May 28, 2021Updated 4 years ago
- [MICCAI'22] Unsupervised Contrastive Learning on Gall Bladder Ultrasound Videos☆11May 28, 2023Updated 2 years ago
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆83Dec 30, 2021Updated 4 years ago
- Graph algorithms to merge two graphs based on stitching.☆12Oct 18, 2019Updated 6 years ago
- ☆14Mar 31, 2022Updated 3 years ago
- [ICCVW'23] Robust Asymmetric Loss for Multi-Label Long-Tailed Learning☆18Oct 3, 2023Updated 2 years ago
- Implementation of Differentiable ODE Solvers using Jittor☆13May 18, 2025Updated 10 months ago