Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)
☆65Jun 27, 2025Updated 10 months ago
Alternatives and similar repositories for distillm-2
Users that are interested in distillm-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo is for CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering.☆14Mar 6, 2024Updated 2 years ago
- Official code release of Hilbert Diffusion Model (PyTorch ver.)☆21Aug 17, 2024Updated last year
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆259Mar 13, 2025Updated last year
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆21Oct 28, 2023Updated 2 years ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆11Jul 9, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Oct 5, 2022Updated 3 years ago
- ☆12Mar 17, 2024Updated 2 years ago
- ☆13Dec 13, 2024Updated last year
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- ☆21Jul 3, 2025Updated 10 months ago
- This repository is the official implementation of "Partial Channel Network: Compute Fewer, Perform Better". [AAAI 2026 Accepted]☆37Feb 11, 2025Updated last year
- Azərbaycan dilində informatika, proqramlaşdırma və kompüter elmləri haqqında açıq və ictimai resurs platforması.☆45Mar 12, 2026Updated last month
- (AAAI 2021) Split-and-Bridge: Adaptable Class Incremental Learning within a Single Neural Network☆24Feb 3, 2021Updated 5 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Apr 19, 2021Updated 5 years ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆63Mar 21, 2026Updated last month
- A simple implementation of reverse mode automatic differentiation in C++ without the use of any libraries.☆12Jul 3, 2018Updated 7 years ago
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆11Jun 2, 2024Updated last year
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆17Oct 17, 2023Updated 2 years ago
- [ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Models☆75Nov 23, 2024Updated last year
- SprintSeoul Homepage☆15Feb 23, 2022Updated 4 years ago
- LSTM GRU with exact backpropagation derivation and implementation☆13Nov 27, 2017Updated 8 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆66Oct 25, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pytorch implementation of our paper MaxQ: Multi-Axis Query for N:M Sparsity Network accepted by CVPR 2024.☆37Mar 12, 2024Updated 2 years ago
- ☆22Oct 22, 2024Updated last year
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆91Sep 13, 2024Updated last year
- ☆29Feb 24, 2026Updated 2 months ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- ☆18Oct 22, 2024Updated last year
- ☆13Sep 25, 2023Updated 2 years ago
- Implementation of SayCan, organized as a python project.☆14Sep 7, 2023Updated 2 years ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆40Jan 23, 2024Updated 2 years ago
- ICML2025: One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework☆15Jun 24, 2025Updated 10 months ago
- ☆62Jun 23, 2025Updated 10 months ago
- ☆18Nov 19, 2024Updated last year
- Quickly hashing all subexpressions of a program modulo alpha-renaming☆17Sep 7, 2021Updated 4 years ago
- ☆15Apr 3, 2026Updated last month
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆15Oct 23, 2023Updated 2 years ago