KuangjuX/Paper-reading

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KuangjuX/Paper-reading)

KuangjuX / Paper-reading

My Paper Reading Lists and Notes.

☆25

Alternatives and similar repositories for Paper-reading

Users that are interested in Paper-reading are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TiledTensor / TiledLower
View on GitHub
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆13Nov 23, 2024Updated last year
nicolaswilde / amx-gemm-handwritten
View on GitHub
Handwritten GEMM using Intel AMX (Advanced Matrix Extension)
☆17Jan 11, 2025Updated last year
tanzelin430 / libsmctrl
View on GitHub
libsmctrl论文的复现，添加了python端接口，可以在python端灵活调用接口来分配计算资源
☆12May 21, 2024Updated 2 years ago
SJTU-IPADS / MetaAttention
View on GitHub
MetaAttention: A Unified and Performant Attention Framework Across Hardware Backends(PPoPP'26)
☆16Dec 31, 2025Updated 6 months ago
HuangShiqing / memory_viz_plus
View on GitHub
☆18Jun 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nox-410 / Welder
View on GitHub
OSDI 2023 Welder, deeplearning compiler
☆34Nov 24, 2023Updated 2 years ago
flashinfer-ai / debug-print
View on GitHub
Debug print operator for cudagraph debugging
☆18Aug 2, 2024Updated last year
tsinghua-ideal / Syno
View on GitHub
Source code repository for ASPLOS '25 paper "Syno: Structured Synthesis for Neural Operators"
☆15Aug 31, 2025Updated 10 months ago
NVlabs / mixedproxy
View on GitHub
☆15Nov 14, 2023Updated 2 years ago
HeliosXCore / HeliosXCore
View on GitHub
HeliosXCore is a Superscalar Out-of-order RISC-V Processor Core.
☆10Mar 8, 2024Updated 2 years ago
IBM / triton-dejavu
View on GitHub
Framework to reduce autotune overhead to zero for well known deployments.
☆102Sep 19, 2025Updated 10 months ago
TiledTensor / TiledKernel
View on GitHub
TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.
☆19May 12, 2024Updated 2 years ago
wzh99 / relay-mlir
View on GitHub
An MLIR-based toy DL compiler for TVM Relay.
☆62Oct 16, 2022Updated 3 years ago
microsoft / cusync
View on GitHub
☆27Feb 20, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
wu-kan / wuk_cupti_wrapper
View on GitHub
a simple API to use CUPTI
☆10Aug 19, 2025Updated 11 months ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
toyaix / triton-runner
View on GitHub
Multi-Level Triton Runner supporting Python, IR, PTX, AMDGCN, cubin and hasco.
☆98May 8, 2026Updated 2 months ago
cyyself / cyyrv64
View on GitHub
My RV64 CPU (Work in progress)
☆19Dec 22, 2022Updated 3 years ago
ChandlerGuan / mercury_artifact
View on GitHub
☆27Oct 1, 2025Updated 9 months ago
KuangjuX / hypocaust-2
View on GitHub
hypocaust-2, a type-1 hypervisor with H extension run on RISC-V machine
☆60Nov 30, 2023Updated 2 years ago
AlibabaResearch / mononn
View on GitHub
☆32Jul 17, 2024Updated 2 years ago
derhuerst / casket
View on GitHub
casket is an easy-to-use web file storage.
☆13Jul 31, 2021Updated 4 years ago
galois-stack / galois
View on GitHub
a tensor computing compiler based tile programming for gpu, cpu or tpu
☆45Feb 2, 2026Updated 5 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
KEKE046 / mlir-tutorial
View on GitHub
Hands-On Practical MLIR Tutorial
☆811Oct 20, 2023Updated 2 years ago
makslevental / mlir-python-extras
View on GitHub
The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.
☆118Mar 4, 2026Updated 4 months ago
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
tud-ccc / Cinnamon
View on GitHub
☆45Updated this week
gilgamsh / GenshinCPU
View on GitHub
Our repository for NSCSCC
☆21Feb 22, 2025Updated last year
TJU-NSL / awesome-papers
View on GitHub
☆37Updated this week
m0dulo / Kaleidoscope
View on GitHub
🐲 LLVM-based Kaleidoscope language compiler ✨ 基于 LLVM 的 Kaleidoscope 编译器
☆12Dec 16, 2022Updated 3 years ago
tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆20Updated this week
l1nkr / DL-Compiler-Navigation
View on GitHub
Machine Learning Compiler Road Map
☆45Sep 12, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yzlnew / infra-skills
View on GitHub
A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-perfo…
☆139Jul 9, 2026Updated 2 weeks ago
muriloboratto / NVSHEMEM
View on GitHub
Sample Codes using NVSHMEM on Multi-GPU
☆30Jan 22, 2023Updated 3 years ago
YJMSTR / flash-linear-attention
View on GitHub
FLA but cuTile
☆27Apr 17, 2026Updated 3 months ago
eth-cscs / Tiled-MM
View on GitHub
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆33Apr 2, 2025Updated last year
yuanxinnn / APTMoE
View on GitHub
☆13Jun 29, 2024Updated 2 years ago
pku-liang / MAGIS
View on GitHub
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆57May 29, 2024Updated 2 years ago
linkedlist771 / UCAS-MOOC-AutoWatch
View on GitHub
☆28Jan 24, 2024Updated 2 years ago