ece-fast-lab/ISCA-2025-LIA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ece-fast-lab/ISCA-2025-LIA)

ece-fast-lab / ISCA-2025-LIA

[ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading

☆25

Alternatives and similar repositories for ISCA-2025-LIA

Users that are interested in ISCA-2025-LIA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hyungyokim / LIA_AMXGPU
View on GitHub
[ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading
☆13Jun 28, 2025Updated last year
onglu1 / LLM_Chat_Navigator
View on GitHub
面向长对话场景的 AI 对话阅读与跳转增强插件。在页面中提供 Prompt 侧边栏、上下快速导航和当前回复大纲，帮助你更高效地浏览、定位和回看整段对话内容。
☆18Mar 11, 2026Updated 4 months ago
Tom-CaoZH / CXL-101
View on GitHub
Contain some materials about CXL.
☆20Feb 29, 2024Updated 2 years ago
huangyibo / Awesome-CXL-Open-Source
View on GitHub
A curated list of open-source projects that help leverage CXL technology.
☆29Sep 26, 2024Updated last year
cyyself / m1-pmu-gen
View on GitHub
Generate Linux Perf event tables for Apple Silicon
☆18Dec 16, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NEO-MLSys25 / NEO
View on GitHub
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
☆99Jun 16, 2025Updated last year
Xilinx / aie-rt
View on GitHub
☆25Jun 14, 2026Updated last month
arkhadem / DX100
View on GitHub
Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper
☆19Nov 6, 2025Updated 8 months ago
Microsemi / switchtec-dma
View on GitHub
☆14Jun 9, 2026Updated last month
MoatLab / SoarAlto
View on GitHub
Tiered Memory Management Beyond Hotness (OSDI'25)
☆37Jul 31, 2025Updated 11 months ago
CASR-HKU / AGNA-FCCM2023
View on GitHub
☆12Nov 24, 2023Updated 2 years ago
a1bc2def6g / fastgl-ae
View on GitHub
☆17Jun 25, 2024Updated 2 years ago
luzhixing12345 / klinux
View on GitHub
linux 内核技术文档
☆16Apr 27, 2026Updated 2 months ago
ChaseLab-PKU / InstAttention
View on GitHub
InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference
☆18Mar 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sscargal / linux-cxl-tracker
View on GitHub
A Python tool for tracking changes in Compute Express Link (CXL) features within the Linux kernel using GitHub API. It supports various o…
☆13Jun 23, 2026Updated 3 weeks ago
riyadparvez / symdrive
View on GitHub
☆11Jun 10, 2015Updated 11 years ago
LLMServe / hydraserve
View on GitHub
☆20May 11, 2026Updated 2 months ago
ucb-bar / RoSE
View on GitHub
A unified simulation platform that combines hardware and software, enabling pre-silicon, full-stack, closed-loop evaluation of your robot…
☆47Jul 8, 2026Updated last week
grplyler / raylib-articles
View on GitHub
My Articles on Raylib related stuff.
☆13Jun 3, 2023Updated 3 years ago
hanhwi / SimPoint
View on GitHub
☆17May 9, 2022Updated 4 years ago
Yufeng98 / CENT
View on GitHub
Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025
☆141May 3, 2025Updated last year
illinois-impact / EMOGI
View on GitHub
☆26Dec 4, 2020Updated 5 years ago
HL-hanlin / GKAT
View on GitHub
☆11Apr 16, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
seanhwang10 / PCIe-CXL
View on GitHub
Note repository for studying Peripheral Component Interconnect Express (PCIe) and Compute Express Link (CXL).
☆15Feb 17, 2025Updated last year
eth-cscs / conflux
View on GitHub
Distributed Communication-Optimal LU-factorization Algorithm
☆12Aug 1, 2021Updated 4 years ago
TianheMICALab / SimCXL
View on GitHub
A full-system, cycle-level simulator based on gem5 that provides complete support for all three CXL sub-protocols and all three types of …
☆155May 11, 2026Updated 2 months ago
CSA-infra / RISCV-Scalable-Simulation-tutorial
View on GitHub
☆15Feb 2, 2026Updated 5 months ago
anuj-rai-23 / Adaptive-Replacement-Cache-ARC-Algorithm
View on GitHub
A project for Advanced Operating System(CS604) that implements ARC cache replacement policy.
☆19Aug 23, 2020Updated 5 years ago
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
ysarch-lab / nimble_page_management_userspace
View on GitHub
☆14Mar 29, 2019Updated 7 years ago
SpRegTiling / sparse-register-tiling
View on GitHub
☆10Mar 2, 2024Updated 2 years ago
RC4ML / CAM
View on GitHub
CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]
☆19Mar 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HPMLL / ZipServ_ASPLOS26
View on GitHub
☆50Dec 19, 2025Updated 7 months ago
HPCRL / ASPLOS_artifact
View on GitHub
☆13Nov 1, 2021Updated 4 years ago
Xilinx / libdfx
View on GitHub
☆13Jun 14, 2026Updated last month
GATECH-EIC / GCoD
View on GitHub
[HPCA 2022] GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆38Mar 30, 2022Updated 4 years ago
parsa-epfl / qflex
View on GitHub
Quick & Flexible Rack-Scale Computer Architecture Simulator
☆54Updated this week
NetFPGA / NetFPGA-PLUS
View on GitHub
☆59Jul 11, 2024Updated 2 years ago
aniketp / ipcp-dsp
View on GitHub
Instruction Pointer Classifier and Dynamic Degree Stream based Hardware Cache Prefetching
☆16Nov 16, 2019Updated 6 years ago