ece-fast-lab / ISCA-2025-LIALinks

[ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading
21Updated 3 weeks ago

Alternatives and similar repositories for ISCA-2025-LIA

Users that are interested in ISCA-2025-LIA are comparing it to the libraries listed below

Sorting: