arumaekawa/DiLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/arumaekawa/DiLM)

arumaekawa / DiLM

Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".

☆28

Alternatives and similar repositories for DiLM

Users that are interested in DiLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

princetonvisualai / What-is-Dataset-Distillation-Learning
View on GitHub
☆18Jun 14, 2024Updated 2 years ago
sunnytqin / no-distillation
View on GitHub
☆22Feb 24, 2025Updated last year
ilia10000 / dataset-distillation
View on GitHub
Soft-Label Dataset Distillation and Text Dataset Distillation
☆74Nov 17, 2022Updated 3 years ago
Jiacheng8 / CV-DD
View on GitHub
Dataset Distillation via Committee Voting
☆15Jul 28, 2025Updated 11 months ago
silicx / LoRS_Distill
View on GitHub
Code for our ICML'24 on multimodal dataset distillation
☆44Oct 11, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LINs-lab / ReLA
View on GitHub
[NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations
☆19Jan 19, 2025Updated last year
NUS-HPC-AI-Lab / DATM
View on GitHub
ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching
☆108May 23, 2024Updated 2 years ago
Guang000 / Awesome-Dataset-Distillation
View on GitHub
A curated list of awesome papers on dataset distillation and related applications.
☆1,964Updated this week
ahmedcs / REFL
View on GitHub
Resource Efficient Federated Learning
☆25Jan 13, 2023Updated 3 years ago
Hansong-Zhang / M3D
View on GitHub
AAAI 2024, M3D: Dataset Condensation by Minimizing Maximum Mean Discrepancy
☆26Mar 2, 2024Updated 2 years ago
VICO-UoE / DatasetCondensation
View on GitHub
Dataset Condensation (ICLR21 and ICML21)
☆542Nov 27, 2023Updated 2 years ago
AsafShul / PoDD
View on GitHub
Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.
☆39Jun 6, 2024Updated 2 years ago
yuz1wan / video_distillation
View on GitHub
Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.
☆32Dec 21, 2025Updated 7 months ago
shaoshitong / G_VBSM_Dataset_Condensation
View on GitHub
[CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)
☆27Oct 9, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RQLuo / MixTeX-DataHub
View on GitHub
LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotation…
☆12Aug 13, 2024Updated last year
dbash / pix2pix_cyclegan_guess_noise
View on GitHub
☆11Jan 21, 2021Updated 5 years ago
THU-KEG / SafetyNeuron
View on GitHub
Data and code for the paper: Finding Safety Neurons in Large Language Models
☆30Jan 29, 2026Updated 5 months ago
aryopg / decore
View on GitHub
Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"
☆30Dec 18, 2024Updated last year
qianyuzqy / CAMix
View on GitHub
(TCSVT 2022) Context-Aware Mixup for Domain Adaptive Semantic Segmentation
☆17Jan 20, 2023Updated 3 years ago
ondrejbohdal / label-distillation
View on GitHub
Official PyTorch implementation of “Flexible Dataset Distillation: Learn Labels Instead of Images”
☆41Oct 21, 2020Updated 5 years ago
vimar-gu / MinimaxDiffusion
View on GitHub
[CVPR2024] Efficient Dataset Distillation via Minimax Diffusion
☆103Mar 22, 2024Updated 2 years ago
LINs-lab / RCGM
View on GitHub
[ICLR 2026] Any-step Generation via N-th Order Recursive Consistent Velocity Field Estimation
☆39Feb 4, 2026Updated 5 months ago
arumaekawa / text-dataset-distillation
View on GitHub
☆13Mar 25, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
amazon-science / ContextualUnderstanding-ContrastiveDecoding
View on GitHub
Enhancing contextual understanding in large language models through contrastive decoding
☆19May 3, 2024Updated 2 years ago
zhukun1020 / NoiseFilter_IB
View on GitHub
☆19Sep 3, 2024Updated last year
princetonvisualai / multimodal_dataset_distillation
View on GitHub
☆65Dec 30, 2024Updated last year
sarapieri / fed_het
View on GitHub
☆12Oct 28, 2023Updated 2 years ago
SamuelGong / grad_attacks
View on GitHub
Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.
☆14Mar 18, 2024Updated 2 years ago
weixuan-wang123 / SADI
View on GitHub
☆19Sep 1, 2025Updated 10 months ago
pipilurj / DynaFed
View on GitHub
☆50Apr 1, 2023Updated 3 years ago
XavierZhang2002 / ICR_Probe
View on GitHub
Code repository of "ICR Probe: Tracking Hidden State Dynamics for Reliable Hallucination Detection in LLMs" (ACL 2025).
☆18Mar 22, 2026Updated 4 months ago
LiuYangArt / PSBanana
View on GitHub
☆20Dec 6, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
andrearosasco / DistilledReplay
View on GitHub
Code for the pubblication "Distilled Replay: Overcoming Forgetting through Synthetic Examples"
☆12Apr 1, 2021Updated 5 years ago
KU-VGI / HMDC
View on GitHub
Official Repository for Heterogeneous Models Dataset Condensation (ECCV 2024, Oral)
☆10Dec 15, 2024Updated last year
yaolu-zjut / DDInterpreter
View on GitHub
☆15May 28, 2024Updated 2 years ago
Iyashinouta / chilloutmix-auto1111-colab
View on GitHub
2-3 Click Run, and enjoy it
☆13Jun 16, 2023Updated 3 years ago
smallporridge / WebUltron
View on GitHub
The source code of the paper "WebUltron: An Ultimate Retriever on Webpages under the Model-centric Paradigm"
☆13Mar 21, 2023Updated 3 years ago
pengbohua / AngularGap
View on GitHub
☆13Jul 20, 2023Updated 3 years ago
UmeanNever / RankSurprisalRatio
View on GitHub
[ACL 2026 Main] Official Repo for Paper "Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Ali…
☆17Jul 1, 2026Updated 3 weeks ago