LOG-postech / rethinking-LLM-pruning
☆27Updated 2 months ago
Alternatives and similar repositories for rethinking-LLM-pruning:
Users that are interested in rethinking-LLM-pruning are comparing it to the libraries listed below
- Official Pytorch code for "SASSHA: Sharpness-aware Adaptive Second-order Optimization With Stable Hessian Approximation"☆12Updated last month
- Code for reproducing the results from arXiv paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"☆17Updated 10 months ago
- ☆13Updated last month
- 🔨 Malet (Machine Learning Experiment Tool) is a tool for efficient machine learning experiment execution, logging, analysis, and plot ma…☆17Updated 3 months ago
- ☆24Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆101Updated last year
- ☆14Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆80Updated 7 months ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Updated 6 months ago
- ☆18Updated 5 months ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆14Updated last year
- ☆12Updated 6 months ago
- Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…☆47Updated last year
- ☆38Updated last year
- 한국어 생성 문서의 원소 사실 관계에 대한 설명 기술☆16Updated 4 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆43Updated 6 months ago
- ☆13Updated 11 months ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆20Updated last year
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆19Updated 2 months ago
- ☆66Updated 3 years ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆32Updated 5 months ago
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization☆22Updated 8 months ago
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆19Updated 7 months ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆17Updated 11 months ago
- Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks☆36Updated 2 months ago
- ☆27Updated last month
- ☆10Updated 2 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆44Updated 6 months ago
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆57Updated 6 months ago
- Official implementation for LaCo (EMNLP 2024 Findings)☆16Updated 6 months ago