JIA-Lab-research / Q-LLMView on GitHub
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
55Jul 16, 2024Updated last year

Alternatives and similar repositories for Q-LLM

Users that are interested in Q-LLM are comparing it to the libraries listed below

Sorting:

Are these results useful?