LOG-postech / rethinking-LLM-pruning
β23Updated 3 months ago
Alternatives and similar repositories for rethinking-LLM-pruning:
Users that are interested in rethinking-LLM-pruning are comparing it to the libraries listed below
- π¨ Malet (Machine Learning Experiment Tool) is a tool for efficient machine learning experiment execution, logging, analysis, and plot maβ¦β17Updated this week
- Code for reproducing the results from arXiv paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"β14Updated 6 months ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)β14Updated last year
- [ICLR 2022] Towards Continual Knowledge Learning of Language Modelsβ92Updated 2 years ago
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Modelsβ79Updated 4 months ago
- β20Updated last year
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Modelsβ69Updated 8 months ago
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)β56Updated 3 months ago
- β23Updated 11 months ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spβ¦β20Updated 10 months ago
- Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"β11Updated last year
- β15Updated 10 months ago
- β23Updated last year
- KAIST AI605 Deep Learning for NLPβ31Updated 2 years ago
- β10Updated 4 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".β92Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]β17Updated 8 months ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasksβ¦β37Updated last month
- β63Updated 2 years ago
- β83Updated 9 months ago
- β36Updated last year
- π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Conβ¦β36Updated last year
- β16Updated last month
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.β24Updated 8 months ago
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"β18Updated 2 years ago
- [NeurIPS 2022 Workshop] A Case Study with Negated Prompts using T0 (3B, 11B), InstructGPT (350M-175B), GPT-3 (350M - 175B) & OPT (125M - β¦β24Updated 2 years ago
- Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023β21Updated 11 months ago
- demo page of krafton virtual Sherlockβ7Updated last year
- Dataset and code for paper: "Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese".β16Updated last month
- νκ΅μ΄ μμ± λ¬Έμμ μμ μ¬μ€ κ΄κ³μ λν μ€λͺ κΈ°μβ14Updated last month