Aaronhuang-778 / SliM-LLM

SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
16Updated last month

Related projects: