Pytorch Implementation of "Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models", AAAI 2025
☆38Feb 4, 2026Updated 3 months ago
Alternatives and similar repositories for Multi-Level-OT
Users that are interested in Multi-Level-OT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆63Mar 21, 2026Updated 2 months ago
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆24Nov 25, 2025Updated 5 months ago
- ☆33Mar 13, 2024Updated 2 years ago
- ☆22Jul 9, 2025Updated 10 months ago
- PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation☆16Mar 28, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆262Mar 13, 2025Updated last year
- Fine-tune GPT2 to generate fake job experiences☆12Jan 17, 2023Updated 3 years ago
- ☆11Feb 3, 2025Updated last year
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆49May 12, 2026Updated last week
- Implementation of DeepMind's "Sobolev Training for Neural Networks"☆11Apr 2, 2018Updated 8 years ago
- ☆38Aug 18, 2025Updated 9 months ago
- [ACL 2026 (Main)] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆82Jul 14, 2025Updated 10 months ago
- A classic five stage pipelined processor☆13Mar 13, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Sep 9, 2024Updated last year
- Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)☆40Aug 28, 2023Updated 2 years ago
- Uses C-GAN for feature hallucination of missing modalities for hyperspectral data. TensorFlow implementation of ICCV '19 paper☆11Sep 9, 2020Updated 5 years ago
- [EMNLP'25 main] This is the official repo for the paper, Can LLMs be Good Graph Judge for Knowledge Graph Construction?☆28Sep 23, 2025Updated 7 months ago
- Implementation of several knowledge distillation techniques on PyTorch☆15Feb 25, 2019Updated 7 years ago
- ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence☆45Aug 8, 2025Updated 9 months ago
- ResNet-50 for TsinghuaDog classification☆10Feb 2, 2021Updated 5 years ago
- Role-Wise Data Augmentation for Knowledge Distillation☆19Nov 22, 2022Updated 3 years ago
- [AAAI 2023] IterDE: An Iterative Knowledge Distillation Framework for Knowledge Graph Embeddings☆10Apr 3, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Zero-Shot Knowledge Distillation in Deep Networks☆67Apr 16, 2022Updated 4 years ago
- Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…☆15Nov 5, 2023Updated 2 years ago
- Linux-capable out-of-order superscaler multicore LoongArch32 (LA32 / LA32R) processor.☆34Aug 9, 2024Updated last year
- Scene classification baseline. Test Acc:90.14%☆16Jul 9, 2019Updated 6 years ago
- a codebase for multi label classification with PyTorch.☆15Nov 23, 2022Updated 3 years ago
- Deep learning techniques for atherosclerotic plaque detection in the carotid artery☆16Jun 16, 2022Updated 3 years ago
- Towards Optimal Structured CNN Pruning via Generative Adversarial Learning☆18Mar 23, 2019Updated 7 years ago
- [MICCAI 2024] Deep Spectral Methods for Unsupervised Ultrasound Image Interpretation☆13Jun 30, 2024Updated last year
- Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.☆20Oct 27, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Graph algorithms to merge two graphs based on stitching.☆12Oct 18, 2019Updated 6 years ago
- ☆14Mar 31, 2022Updated 4 years ago
- ☆19May 22, 2018Updated 8 years ago
- [ICCVW'23] Robust Asymmetric Loss for Multi-Label Long-Tailed Learning☆18Oct 3, 2023Updated 2 years ago
- Top view for the car☆23Apr 10, 2022Updated 4 years ago
- ☆31Apr 10, 2026Updated last month
- ☆10Nov 23, 2020Updated 5 years ago