RUCKBReasoning / LLM-Streamline

Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"
19Updated last week

Alternatives and similar repositories for LLM-Streamline:

Users that are interested in LLM-Streamline are comparing it to the libraries listed below