This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).
☆46Oct 17, 2022Updated 3 years ago
Alternatives and similar repositories for PLATON
Users that are interested in PLATON are comparing it to the libraries listed below
Sorting:
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- ☆11Jan 19, 2025Updated last year
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- ☆235Jun 11, 2024Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆16Jan 26, 2026Updated last month
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆192Feb 28, 2023Updated 3 years ago
- Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)☆40Aug 28, 2023Updated 2 years ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 4 months ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated last year
- ☆20Nov 4, 2025Updated 4 months ago
- ☆21May 24, 2024Updated last year
- DDRel: A new dataset for interpersonal relation classification in dyadic dialogues☆23Sep 12, 2021Updated 4 years ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆55Jan 13, 2025Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 6 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆62Feb 21, 2025Updated last year
- ☆63Oct 17, 2023Updated 2 years ago
- [ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chen…☆28Aug 29, 2023Updated 2 years ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆34Jun 20, 2024Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆260Dec 16, 2024Updated last year
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆369Jun 1, 2023Updated 2 years ago
- Recommender for suggesting letter writers 👍☆34Aug 12, 2024Updated last year
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- ☆36Feb 26, 2024Updated 2 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated last year
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 9 months ago
- A collection of paper implementations using the PyTorch framework☆29Jun 11, 2021Updated 4 years ago
- [NeurIPS 2023] Understanding and Improving Feature Learning for Out-of-Distribution Generalization☆29May 27, 2025Updated 9 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆78Nov 25, 2024Updated last year
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- ☆11Dec 23, 2024Updated last year
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Nov 4, 2023Updated 2 years ago
- Concurrency library☆17Oct 13, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year