Twilight92z / Quantize-WatermarkLinks

☆19

Alternatives and similar repositories for Quantize-Watermark

Users that are interested in Quantize-Watermark are comparing it to the libraries listed below

Sorting:

decoding-comp-trust / comp-trust
Codebase for decoding compressed trust.
☆24Updated last year
SempraETY / Pruning-via-Merging
☆20Updated 9 months ago
hahnyuan / ASVD4LLM
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆78Updated 11 months ago
SafeAILab / RAIN
[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning
☆97Updated last year
avalonstrel / Mitigating-the-Alignment-Tax-of-RLHF
☆15Updated last year
zyxxmu / DSnoT
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆49Updated last year
MrGGLS / BlockPruner
A block pruning framework for LLMs.
☆25Updated 4 months ago
sail-sg / LightTrans
The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"
☆20Updated 5 months ago
AI45Lab / REEF
The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…
☆63Updated 8 months ago
UCSB-NLP-Chang / ThinkPrune
☆43Updated 5 months ago
sail-sg / I-FSJ
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)
☆64Updated 8 months ago
DYR1 / MoGU
Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.
☆16Updated 8 months ago
ldery / Bonsai
Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"
☆28Updated last year
yaojin17 / Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆60Updated 11 months ago
htqin / IR-QLoRA
[ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…
☆67Updated last year
VITA-Group / Junk_DNA_Hypothesis
[ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…
☆16Updated 5 months ago
yxli2123 / LoSparse
☆59Updated last year
raymin0223 / fast_robust_early_exit
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
☆62Updated 11 months ago
GeniusHTX / TALE
☆130Updated last week
harveyhuang18 / EMR_Merging
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆69Updated 6 months ago
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆179Updated last year
Lucky-Lance / Expert_Sparsity
[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
☆101Updated last year
machilusZ / FastGen
This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
☆39Updated last year
mrflogs / LoRA-Pro
Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "
☆131Updated 5 months ago
xiatingyu / SFT-DataSelection-at-scale
☆30Updated 7 months ago
hemingkx / TokenSkip
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆179Updated 2 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆85Updated 7 months ago
zjysteven / mink-plus-plus
[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs
☆44Updated 3 months ago
VITA-Group / SEAL
Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆40Updated 5 months ago
Zanette-Labs / SpeculativeRejection
[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection
☆51Updated 10 months ago