Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".
☆29Feb 10, 2025Updated last year
Alternatives and similar repositories for DiLM
Users that are interested in DiLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆23Jan 29, 2026Updated last month
- ☆20Feb 24, 2025Updated last year
- ☆33Aug 28, 2024Updated last year
- Soft-Label Dataset Distillation and Text Dataset Distillation☆74Nov 17, 2022Updated 3 years ago
- Prioritize Alignment in Dataset Distillation☆21Dec 3, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for our ICML'24 on multimodal dataset distillation☆43Oct 11, 2024Updated last year
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆106May 23, 2024Updated last year
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆19Nov 17, 2025Updated 4 months ago
- LAMP: Extracting Text from Gradients with Language Model Priors (NeurIPS '22)☆29May 26, 2025Updated 10 months ago
- AAAI 2024, M3D: Dataset Condensation by Minimizing Maximum Mean Discrepancy☆25Mar 2, 2024Updated 2 years ago
- Official Implementation of "The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thou…☆14Jul 2, 2025Updated 8 months ago
- Dataset Condensation (ICLR21 and ICML21)☆543Nov 27, 2023Updated 2 years ago
- ☆10Apr 29, 2023Updated 2 years ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Aug 5, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆28Oct 9, 2024Updated last year
- Enhancing contextual understanding in large language models through contrastive decoding☆20May 3, 2024Updated last year
- Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.☆31Dec 21, 2025Updated 3 months ago
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.☆39Jun 6, 2024Updated last year
- Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.☆15Mar 18, 2024Updated 2 years ago
- (TCSVT 2022) Context-Aware Mixup for Domain Adaptive Semantic Segmentation☆17Jan 20, 2023Updated 3 years ago
- Application and blog explaining my interpretations of In-run Data Shapley☆30Jan 30, 2025Updated last year
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Mar 10, 2026Updated 2 weeks ago
- Official PyTorch implementation of “Flexible Dataset Distillation: Learn Labels Instead of Images”☆41Oct 21, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Low-rank Highway Networks☆13Mar 11, 2016Updated 10 years ago
- ☆13Mar 25, 2022Updated 4 years ago
- A pytorch implementation of CVPR24 paper "D4M: Dataset Distillation via Disentangled Diffusion Model"☆40Sep 6, 2024Updated last year
- Code for the pubblication "Distilled Replay: Overcoming Forgetting through Synthetic Examples"☆12Apr 1, 2021Updated 4 years ago
- ☆50Apr 1, 2023Updated 2 years ago
- ☆64Dec 30, 2024Updated last year
- Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"☆12Jun 25, 2024Updated last year
- ☆13Jul 20, 2023Updated 2 years ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Dec 14, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PrivacyAsst: Safeguarding User Privacy in Tool-Using Large Language Model Agents (TDSC 2024)☆19Mar 29, 2024Updated last year
- Official codebase for paper Disentangled Condensation for Graphs (DisCo). This codebase is based on the open-source Pytorch Geometric fra…☆11Feb 12, 2025Updated last year
- Planning for Success: Exploring LLM Long-term Planning Capabilities in Table Understanding☆17Jun 17, 2025Updated 9 months ago
- Official Repository for Heterogeneous Models Dataset Condensation (ECCV 2024, Oral)☆10Dec 15, 2024Updated last year
- 2-3 Click Run, and enjoy it☆13Jun 16, 2023Updated 2 years ago
- Repository that contains the code for the paper titled, 'Unifying Distillation with Personalization in Federated Learning'.☆13May 31, 2021Updated 4 years ago
- The Third Place Winner in Generative Track of the ECCV 2024 DD Challenge☆10Oct 11, 2024Updated last year