microsoft/cusync

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/cusync)

microsoft / cusync

☆27

Alternatives and similar repositories for cusync

Users that are interested in cusync are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆19Jul 13, 2026Updated last week
summerspringwei / souffle-ae
View on GitHub
☆17Jan 24, 2024Updated 2 years ago
microsoft / TileFusion
View on GitHub
TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.
☆115Jun 28, 2025Updated last year
TiledTensor / TiledLower
View on GitHub
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆13Nov 23, 2024Updated last year
robcasloz / llvm-discovery
View on GitHub
Discovery of Structured Parallelism In Sequential and Parallel Code
☆10Feb 13, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
weishengying / cutlass_flash_atten_fp8
View on GitHub
使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention
☆82Aug 12, 2024Updated last year
KuangjuX / cu-x
View on GitHub
🎉My Collections of CUDA Kernels~
☆11Jun 25, 2024Updated 2 years ago
MatanHamilis / one_stencil
View on GitHub
Multiple 1-stencil implementations using nvidia cuda.
☆12Dec 2, 2017Updated 8 years ago
tanzelin430 / libsmctrl
View on GitHub
libsmctrl论文的复现，添加了python端接口，可以在python端灵活调用接口来分配计算资源
☆12May 21, 2024Updated 2 years ago
Ecybereg / HTB_Write_Ups
View on GitHub
HTB_Write_Ups
☆27Feb 25, 2024Updated 2 years ago
VivekPanyam / cudaparsers
View on GitHub
Parsers for CUDA binary files
☆25Dec 29, 2023Updated 2 years ago
TiledTensor / TiledCUDA
View on GitHub
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆192Jan 28, 2025Updated last year
microsoft / FractalTensor
View on GitHub
FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …
☆32Dec 21, 2024Updated last year
cchan / tccl
View on GitHub
extensible collectives library in triton
☆97Mar 31, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LeiWang1999 / TVM.CMakeExtend
View on GitHub
Tutorials of Extending and importing TVM with CMAKE Include dependency.
☆16Oct 11, 2024Updated last year
tile-ai / tilelang-benchmark
View on GitHub
☆22Jun 10, 2026Updated last month
mlc-ai / mlc-python
View on GitHub
☆36Jul 19, 2025Updated last year
microsoft / DataCenterBridging
View on GitHub
LLDP Fabric Info Parsing and DSC Resources used to configured Data Center Bridging - Check https://aka.ms/Validate-DCB for more informati…
☆15Nov 28, 2022Updated 3 years ago
ColfaxResearch / cfx-article-src
View on GitHub
☆192May 7, 2025Updated last year
alibaba / redfuser
View on GitHub
☆21Mar 17, 2026Updated 4 months ago
AlibabaResearch / mononn
View on GitHub
☆32Jul 17, 2024Updated 2 years ago
sjfeng1999 / gpu-arch-microbenchmark
View on GitHub
Dissecting NVIDIA GPU Architecture
☆125Jul 11, 2022Updated 4 years ago
microsoft / azure-terraform-storage-datalifecycle
View on GitHub
Terraform Script for - Storage, container and data life cycle rules creation at scale
☆11Jan 10, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
microsoft / hibernation-setup-tool
View on GitHub
Tool to set up a Linux computer to hibernate
☆13Jan 28, 2026Updated 5 months ago
flashinfer-ai / cutlass-viz
View on GitHub
☆65Apr 26, 2025Updated last year
PolyArch / dsagen2
View on GitHub
Domain-Specific Architecture Generator 2
☆26Oct 2, 2022Updated 3 years ago
facebookexperimental / triton
View on GitHub
Github mirror of trition-lang/triton repo.
☆178Updated this week
IBM / triton-dejavu
View on GitHub
Framework to reduce autotune overhead to zero for well known deployments.
☆101Sep 19, 2025Updated 10 months ago
toyaix / triton-runner
View on GitHub
Multi-Level Triton Runner supporting Python, IR, PTX, AMDGCN, cubin and hasco.
☆98May 8, 2026Updated 2 months ago
nicolaswilde / amx-gemm-handwritten
View on GitHub
Handwritten GEMM using Intel AMX (Advanced Matrix Extension)
☆17Jan 11, 2025Updated last year
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
feifeibear / ChituAttention
View on GitHub
Quantized Attention on GPU
☆45Nov 22, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
microsoft / msr-cloak
View on GitHub
Code for experiments referenced in the Usenix Security 2017 paper "Strong and Efficient Cache Side-Channel Protection using Hardware Tran…
☆14Sep 8, 2022Updated 3 years ago
microsoft / IIS.Common
View on GitHub
Common source and libraries used by IIS projects
☆25Mar 21, 2025Updated last year
microsoft / Microsecond-Arduino-Latency-Clock
View on GitHub
Microsecond Arduino Code and Schematics accompanying the paper presented in IEEE VR 2020 "Measuring System Visual Latency through Cogniti…
☆15Mar 26, 2020Updated 6 years ago
mcrl / tccl
View on GitHub
Thunder Research Group's Collective Communication Library
☆53Jul 8, 2025Updated last year
microsoft / calipers
View on GitHub
Criticality-aware Framework for Modeling Computer Performance
☆34Dec 15, 2024Updated last year
cakeng / ASPEN
View on GitHub
This is the proof-of-concept CPU implementation of ASPEN used for the NeurIPS'23 paper ASPEN: Breaking Operator Barriers for Efficient Pa…
☆13Apr 4, 2024Updated 2 years ago
flashinfer-ai / cubloaty
View on GitHub
a size profiler for cuda binary
☆71Jan 15, 2026Updated 6 months ago