ChengZhang-98 / llm-mixed-qLinks

Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"
22Updated last year

Alternatives and similar repositories for llm-mixed-q

Users that are interested in llm-mixed-q are comparing it to the libraries listed below

Sorting: