hyungyokim/LIA_AMXGPU

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hyungyokim/LIA_AMXGPU)

hyungyokim / LIA_AMXGPU

[ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading

☆13

Alternatives and similar repositories for LIA_AMXGPU

Users that are interested in LIA_AMXGPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ece-fast-lab / ISCA-2025-LIA
View on GitHub
[ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading
☆25Jan 6, 2026Updated 6 months ago
sfu-arch / SpGEMM
View on GitHub
☆35Apr 20, 2021Updated 5 years ago
SZU-AdvTech-2022 / 376-HyGCN-A-GCN-Accelerator-with-Hybrid-Architecture
View on GitHub
☆12Mar 14, 2023Updated 3 years ago
Xilinx / aie-rt
View on GitHub
☆25Jun 14, 2026Updated last month
VITA-Group / Q-Hitter
View on GitHub
☆15Jun 4, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Azure / Moneo
View on GitHub
Distributed AI/HPC Monitoring Framework
☆29Apr 11, 2025Updated last year
tsinghua-ideal / spada-sim
View on GitHub
The simulator for SPADA, an SpGEMM accelerator with adaptive dataflow
☆47Jan 26, 2023Updated 3 years ago
esa-tu-darmstadt / spn-compiler
View on GitHub
Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.
☆25Nov 29, 2024Updated last year
GATECH-EIC / GCoD
View on GitHub
[HPCA 2022] GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆38Mar 30, 2022Updated 4 years ago
sharc-lab / GenGNN
View on GitHub
☆37Jan 20, 2022Updated 4 years ago
kateinoigakukun / llvm-next-function-merging
View on GitHub
An experimental LLVM pass plugin that allows you to apply the State of the Art function merging techniques
☆16Feb 8, 2025Updated last year
sfu-arch / SPAGHETTI
View on GitHub
RTL generator for SpGEMM
☆12Feb 2, 2021Updated 5 years ago
NeuraChip / neurachip
View on GitHub
NeuraChip Accelerator Simulator
☆16Apr 26, 2024Updated 2 years ago
promoe-opensource / promoe
View on GitHub
☆20Jan 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ysarch-lab / nimble_page_management_userspace
View on GitHub
☆14Mar 29, 2019Updated 7 years ago
Xilinx / libdfx
View on GitHub
☆13Jun 14, 2026Updated last month
thu-nics / CLAP-triangle-counting
View on GitHub
[DATE'23] The official code for paper <CLAP: Locality Aware and Parallel Triangle Counting with Content Addressable Memory>
☆24May 25, 2026Updated 2 months ago
keijiro / DxrSketch230728
View on GitHub
Small experimental Unity project: DXR + Spline
☆14Sep 13, 2023Updated 2 years ago
einverne / AndroidFaceDetectDemo
View on GitHub
Android 人脸检测 android.media, play service, Face++
☆11Aug 13, 2016Updated 9 years ago
umd-memsys / gem5
View on GitHub
This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews shoul…
☆13May 15, 2020Updated 6 years ago
sibyl-dev / Explingo
View on GitHub
Explaining ML models using LLMs
☆25Oct 21, 2024Updated last year
ukonpower / glsl-graphics
View on GitHub
⚠⚠ Integrated to https://github.com/ukonpower/ore-gl ⚠⚠
☆11Jun 29, 2023Updated 3 years ago
msakuta / rusty-behavior-tree-lite
View on GitHub
Lightweight behavior tree implementation in Rust
☆11Jan 4, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TrelisResearch / llama-2-setup
View on GitHub
Prompt format and padding guide for Llama 2
☆12Sep 18, 2023Updated 2 years ago
dorpxam / einops-cpp
View on GitHub
C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation
☆12Oct 16, 2023Updated 2 years ago
jocstech / AndroidCameraSudokuSolver
View on GitHub
An OpenCV Android Camera Sudoku Solver
☆11Apr 7, 2017Updated 9 years ago
glacierx / rproxy
View on GitHub
A blazing fast, cross-platform TCP & UDP proxy with automatic DNS re-resolution and a built-in terminal UI config editor. Zero-downtime h…
☆13Apr 21, 2026Updated 3 months ago
jwvg0425 / Integer
View on GitHub
C++ BigInteger
☆16Feb 25, 2015Updated 11 years ago
AIS-SNU / GraNNDis_Artifact
View on GitHub
[PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…
☆10Aug 13, 2024Updated last year
KeenS / gtk-examples
View on GitHub
☆12Jul 23, 2023Updated 3 years ago
UDC-GAC / polybench-python
View on GitHub
PolyBench/Python is the reimplementation of PolyBench in the Python programming language. It is a benchmark suite of 30 numerical computa…
☆10Feb 23, 2021Updated 5 years ago
zhenlin36 / scatter_gather_aes_cuda
View on GitHub
A High-Performance Side-Channel-Resistant AES on GPUs
☆13May 9, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
IaroslavElistratov / triton-autodiff
View on GitHub
☆19Nov 11, 2025Updated 8 months ago
gem5-hpca-2024 / gem5
View on GitHub
☆10Mar 3, 2024Updated 2 years ago
hongkunyoo / how-to-scale-your-ml-job-with-k8s
View on GitHub
Open Infrastructure & Cloud Native Days Korea 2019 - Workshop
☆18Mar 25, 2021Updated 5 years ago
tqfang / comet-deepspeed
View on GitHub
Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.
☆14Jan 23, 2022Updated 4 years ago
ipuneetrathore / BERT_models
View on GitHub
This repository contains fine tuned BERT models
☆12Jul 17, 2020Updated 6 years ago
Amit-P89 / -DRackSim
View on GitHub
Pin based tool for simulation of rack-scale disaggregated memory systems
☆33Mar 8, 2025Updated last year
phi16 / book-of-space
View on GitHub
「空間表現の絵本」
☆13Jan 15, 2022Updated 4 years ago