Pytorch Implementation of "Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models", AAAI 2025
☆38Feb 4, 2026Updated 2 months ago
Alternatives and similar repositories for Multi-Level-OT
Users that are interested in Multi-Level-OT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Mar 13, 2024Updated 2 years ago
- Official code for 'TraM: Enhancing User Sleep Prediction with Transformer-Based Multivariate Time Series Modeling and Machine Learning En…☆19Jul 3, 2024Updated last year
- ☆22Jul 9, 2025Updated 9 months ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆258Mar 13, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fine-tune GPT2 to generate fake job experiences☆12Jan 17, 2023Updated 3 years ago
- ☆11Feb 3, 2025Updated last year
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆47Feb 13, 2025Updated last year
- ☆48Feb 1, 2022Updated 4 years ago
- Implementation of DeepMind's "Sobolev Training for Neural Networks"☆11Apr 2, 2018Updated 8 years ago
- ☆37Aug 18, 2025Updated 8 months ago
- A small demo for training cnn with pytorch.☆11Dec 15, 2018Updated 7 years ago
- [ACL 2026 (Main)] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆82Jul 14, 2025Updated 9 months ago
- ☆13Jun 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This package can be used to read image data using BioFormats into numpy arrays. It was initially created to read CZI image files, but sho…☆12May 21, 2021Updated 4 years ago
- Uses C-GAN for feature hallucination of missing modalities for hyperspectral data. TensorFlow implementation of ICCV '19 paper☆11Sep 9, 2020Updated 5 years ago
- ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence☆45Aug 8, 2025Updated 8 months ago
- Learn how to create and deploy an ESP high availability system using Kafka as the message broker.☆10Feb 20, 2020Updated 6 years ago
- ResNet-50 for TsinghuaDog classification☆10Feb 2, 2021Updated 5 years ago
- ☆19Sep 24, 2022Updated 3 years ago
- Official implementation: Population Aware Diffusion for Time Series Generation (AAAI-25)☆16Sep 1, 2025Updated 8 months ago
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- Role-Wise Data Augmentation for Knowledge Distillation☆19Nov 22, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Zero-Shot Knowledge Distillation in Deep Networks☆67Apr 16, 2022Updated 4 years ago
- [TMLR'26] UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models☆54Mar 24, 2026Updated last month
- Scene classification baseline. Test Acc:90.14%☆16Jul 9, 2019Updated 6 years ago
- a codebase for multi label classification with PyTorch.☆15Nov 23, 2022Updated 3 years ago
- This repo holds the code for: {Transformer-based Spatio-temporal Analysis for Automatic Classification of Aortic Stenosis Severity from B…☆13Nov 29, 2022Updated 3 years ago
- Deep learning techniques for atherosclerotic plaque detection in the carotid artery☆16Jun 16, 2022Updated 3 years ago
- A virtual machine implementation of "伟福" COP2000 development board (microinstruction level)☆17Dec 22, 2022Updated 3 years ago
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated 3 months ago
- Towards Optimal Structured CNN Pruning via Generative Adversarial Learning☆18Mar 23, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code and data for the GPT-4 based benchmark in the vicuna blog post☆43Aug 2, 2023Updated 2 years ago
- Boilerplate Django REST and Angular app with jwt login☆14Jan 7, 2023Updated 3 years ago
- [MICCAI 2024] Deep Spectral Methods for Unsupervised Ultrasound Image Interpretation☆13Jun 30, 2024Updated last year
- Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.☆20Oct 27, 2021Updated 4 years ago
- GitHub repository for KDD 2021 work: ProtoPShare: Prototypical Parts Sharing for Similarity Discovery in Interpretable Image Classificati…☆14May 30, 2021Updated 4 years ago
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆82Dec 30, 2021Updated 4 years ago
- ☆14Mar 31, 2022Updated 4 years ago