Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)
☆61Jun 27, 2025Updated 9 months ago
Alternatives and similar repositories for distillm-2
Users that are interested in distillm-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo is for CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering.☆14Mar 6, 2024Updated 2 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆16Apr 29, 2025Updated 11 months ago
- Official code release of Hilbert Diffusion Model (PyTorch ver.)☆21Aug 17, 2024Updated last year
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆256Mar 13, 2025Updated last year
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆44Mar 31, 2026Updated last week
- Estimators for Information Theoretic Functionals using Influence Functions☆11Apr 17, 2016Updated 9 years ago
- Official PyTorch implementation of SynergyNeRF: "Synergistic Integration of Coordinate Network and Tensorial Feature for Improving NeRFs …☆12Sep 23, 2024Updated last year
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".☆16Dec 7, 2021Updated 4 years ago
- ☆31Jan 16, 2025Updated last year
- (AAAI 2021) Split-and-Bridge: Adaptable Class Incremental Learning within a Single Neural Network☆24Feb 3, 2021Updated 5 years ago
- [AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan☆14Oct 18, 2022Updated 3 years ago
- Azərbaycan dilində informatika, proqramlaşdırma və kompüter elmləri haqqında açıq və ictimai resurs platforması.☆45Mar 12, 2026Updated last month
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Apr 19, 2021Updated 4 years ago
- General Discussion☆12May 22, 2019Updated 6 years ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆62Mar 21, 2026Updated 3 weeks ago
- WildVSR☆21Dec 13, 2023Updated 2 years ago
- Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large …☆27Oct 10, 2024Updated last year
- Open-source code and data for ShadowNet(S&P Oakland'23)☆12Mar 11, 2024Updated 2 years ago
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆11Jun 2, 2024Updated last year
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆17Oct 17, 2023Updated 2 years ago
- [ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Models☆75Nov 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SprintSeoul Homepage☆15Feb 23, 2022Updated 4 years ago
- LSTM GRU with exact backpropagation derivation and implementation☆13Nov 27, 2017Updated 8 years ago
- PyTorch implementation of the article "Generative Adversarial Network for Handwritten Text"☆10Nov 13, 2023Updated 2 years ago
- The codes for ECCV'22: Learning to Train a Point Cloud Reconstruction Network without Matching☆10Nov 16, 2022Updated 3 years ago
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆58Dec 26, 2025Updated 3 months ago
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆91Sep 13, 2024Updated last year
- ☆19Jan 26, 2025Updated last year
- ☆28Feb 24, 2026Updated last month
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆14Feb 26, 2024Updated 2 years ago
- Code for AAAI'25 paper: LLM-Powered User Simulator for Recommender System☆25Jan 6, 2025Updated last year
- ☆18Nov 19, 2024Updated last year
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆43Aug 6, 2024Updated last year
- ☆15Apr 3, 2026Updated last week
- Code for "MetaFun: Meta-Learning with Iterative Functional Updates"☆14Aug 27, 2020Updated 5 years ago
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆15Oct 23, 2023Updated 2 years ago