yinuotxie / Efficient-LLM-Inferencing-on-GPUs

Penn CIS 5650 (GPU Programming and Architecture) Final Project
28Updated last year

Alternatives and similar repositories for Efficient-LLM-Inferencing-on-GPUs:

Users that are interested in Efficient-LLM-Inferencing-on-GPUs are comparing it to the libraries listed below