LOG-postech / rethinking-LLM-pruning
β21Updated last month
Related projects β
Alternatives and complementary repositories for rethinking-LLM-pruning
- π¨ Malet (Machine Learning Experiment Tool) is a tool for efficient machine learning experiment execution, logging, analysis, and plot maβ¦β16Updated 2 months ago
- Code for reproducing the results from arXiv paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"β13Updated 4 months ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)β12Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Modelsβ76Updated 2 months ago
- β22Updated 9 months ago
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Modelsβ66Updated 6 months ago
- β20Updated last year
- KAIST AI605 Deep Learning for NLPβ31Updated 2 years ago
- [ICLR 2022] Towards Continual Knowledge Learning of Language Modelsβ93Updated 2 years ago
- νκ΅μ΄ μμ± λ¬Έμμ μμ μ¬μ€ κ΄κ³μ λν μ€λͺ κΈ°μβ13Updated last month
- β15Updated 8 months ago
- Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023β19Updated 9 months ago
- π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Conβ¦β34Updated 11 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".β87Updated last year
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"β47Updated last month
- β36Updated last year
- β13Updated last year
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domainβ22Updated 2 years ago
- β10Updated 9 months ago
- β63Updated 2 years ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QAβ16Updated 2 years ago
- β58Updated last year
- CharFormer(Tay et al., 2022; Gradient-based Subword Tokenizer + T5) model implementation for Huggingface Transformersβ21Updated last month
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).β59Updated 2 years ago
- β15Updated 2 years ago
- Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"β11Updated last year
- β48Updated last year
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysisβ10Updated last week
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spβ¦β19Updated 8 months ago
- [EMNLP 2023 Findings] Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Promptβ20Updated last year