tfruan2000/mlsys-study-note

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tfruan2000/mlsys-study-note)

tfruan2000 / mlsys-study-note

My study note for mlsys

☆14

Alternatives and similar repositories for mlsys-study-note

Users that are interested in mlsys-study-note are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

summerspringwei / souffle-ae
View on GitHub
☆17Jan 24, 2024Updated 2 years ago
microsoft / triton-shared
View on GitHub
Shared Middle-Layer for Triton Compilation
☆340Dec 5, 2025Updated 7 months ago
Cambricon / triton-linalg
View on GitHub
Development repository for the Triton-Linalg conversion
☆221Feb 7, 2025Updated last year
serdes21 / flashtile
View on GitHub
FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.
☆61Feb 6, 2026Updated 5 months ago
joker-eph / llvm-project-with-mlir
View on GitHub
Clone of the LLVM project with MLIR repo integrated as a top-level subproject
☆12Dec 11, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Mogball / triton_lite
View on GitHub
☆20May 24, 2025Updated last year
zartbot / gfd
View on GitHub
GPU Functional Descriptor for memory access
☆34May 24, 2026Updated last month
buddy-compiler / buddy-mlir
View on GitHub
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
☆742Updated this week
FHE-Applications / FHE-Applications
View on GitHub
FHE (CKKS, TFHE) end-to-end applications: HELR (logistic regression), ResNet-20, LSTM (RNN), bitonic sorting, DeepCNN-x
☆18Aug 14, 2024Updated last year
violetDelia / LLCompiler
View on GitHub
☆25Jun 11, 2025Updated last year
DiscreteTom / dt-blog-boilerplate
View on GitHub
DiscreteTom's Blog Boilerplate.
☆10Mar 6, 2023Updated 3 years ago
AlibabaResearch / mononn
View on GitHub
☆32Jul 17, 2024Updated 2 years ago
wehu / c-mlir
View on GitHub
A translator from c to MLIR
☆33Nov 15, 2021Updated 4 years ago
whutbd / cuda-learn-note
View on GitHub
🎉CUDA 笔记 / 高频面试题汇总 / C++笔记，个人笔记，更新随缘: sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
☆48Jan 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YJMSTR / flash-linear-attention
View on GitHub
FLA but cuTile
☆27Apr 17, 2026Updated 3 months ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
misson20000 / twili-libnx
View on GitHub
Twili I/O library for libnx
☆14Jan 13, 2020Updated 6 years ago
caiwanxianhust / FasterLLaMA
View on GitHub
使用 CUDA C++ 实现的 llama 模型推理框架
☆64Nov 8, 2024Updated last year
wu-kan / wuk_cupti_wrapper
View on GitHub
a simple API to use CUPTI
☆10Aug 19, 2025Updated 11 months ago
csirlin / OpenTGPTPU
View on GitHub
Fork of github.com/UCSBarchlab/OpenTPU for the TGPTPU project
☆15Jun 1, 2025Updated last year
gangliao / MapReduceFramework
View on GitHub
Map Reduce infrastructure lite using c++11 and gRPC
☆22Dec 4, 2016Updated 9 years ago
makslevental / nelli
View on GitHub
A lightweight, Pythonic, frontend for MLIR
☆80Oct 21, 2023Updated 2 years ago
xiaotianxia / vue-163news-dev
View on GitHub
a vue-demo：vue仿网易新闻m站
☆10Jul 26, 2017Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kaist-cp / shakeflow
View on GitHub
ShakeFlow: Functional Hardware Description with Latency-Insensitive Interface Combinators (ASPLOS 2023)
☆58Jan 23, 2025Updated last year
toyaix / triton-ocl
View on GitHub
Triton for OpenCL backend, and use mlir-translate to get source OpenCL code
☆27Aug 27, 2025Updated 10 months ago
arc-research-lab / Aries
View on GitHub
ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)
☆63Mar 8, 2026Updated 4 months ago
intel / intel-xpu-backend-for-triton
View on GitHub
OpenAI Triton backend for Intel® GPUs
☆258Updated this week
spcl / pymlir
View on GitHub
Python interface for MLIR - the Multi-Level Intermediate Representation
☆271Nov 28, 2024Updated last year
KnorrFG / qsp
View on GitHub
A simple S-Expression parser for rust TokenStreams
☆16Nov 23, 2025Updated 7 months ago
human-analysis / AutoFHE
View on GitHub
Official implementation for AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE. The paper is presented at the 33rd USE…
☆34Nov 24, 2025Updated 7 months ago
triton-lang / Triton-to-tile-IR
View on GitHub
incubator repo for CUDA-TileIR backend
☆148Jul 10, 2026Updated last week
kitbarton / LLVMLoopOptTutorial
View on GitHub
Tutorial for LLVM Dev Conference 2019.
☆15Oct 23, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
open-lm-engine / accelerated-model-architectures
View on GitHub
A bunch of kernels that might make stuff slower 😉
☆91Updated this week
janestreet / ocaml-compiler-libs
View on GitHub
compiler libraries repackaged
☆21Jan 4, 2024Updated 2 years ago
kimbochen / mini-llama-mlx
View on GitHub
A simple LLaMA implementation using MLX.
☆15Apr 22, 2024Updated 2 years ago
gpu-mode / ring-attention
View on GitHub
ring-attention experiments
☆171Oct 17, 2024Updated last year
meta-pytorch / spmd_types
View on GitHub
This module defines a type system for distributed training code, based off of JAX's sharding in types, but adapted for the PyTorch ecosys…
☆34Updated this week
gty111 / PTX-EMU
View on GitHub
PTX-EMU is a simple emulator for CUDA program.
☆40Apr 25, 2025Updated last year
billmuch / matmul_perf_test
View on GitHub
☆15Apr 15, 2022Updated 4 years ago