jiazhihao/sosp19ae

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jiazhihao/sosp19ae)

jiazhihao / sosp19ae

Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions

☆21

Alternatives and similar repositories for sosp19ae

Users that are interested in sosp19ae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kemingy / vllm-env
View on GitHub
setup the env for vllm users
☆16Oct 31, 2023Updated 2 years ago
petuum-inc / poseidon-release
View on GitHub
Release doc/tutorial/wheels for poseidon-tf
☆10Jan 18, 2018Updated 8 years ago
awslabs / lorien
View on GitHub
☆42Sep 8, 2023Updated 2 years ago
uuudown / SBNN
View on GitHub
Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)
☆17Dec 9, 2020Updated 5 years ago
mblo / hire-cluster-simulator
View on GitHub
Switches for HIRE: Resource Scheduling for Data Center In-Network Computing
☆13Jan 18, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NVIDIA / jax-tvm-ffi
View on GitHub
JAX support for tvm-ffi abi
☆26May 14, 2026Updated 2 months ago
chips-compilers-mlsys-21 / chips-compilers-mlsys-21.github.io
View on GitHub
☆11Apr 5, 2021Updated 5 years ago
hhy3 / pyanns
View on GitHub
🏆 The winner code for Neurips'23 BigANN Competition OOD and Sparse track.
☆15Jun 17, 2025Updated last year
czkkkkkk / gccl
View on GitHub
☆13Jan 23, 2021Updated 5 years ago
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
cmu-catalyst / collage
View on GitHub
System for automated integration of deep learning backends.
☆47Aug 15, 2022Updated 3 years ago
Brown-NSG / P4Visor
View on GitHub
☆14Dec 26, 2022Updated 3 years ago
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yutxie / cpu-riscv
View on GitHub
ACM Class 2017 Computer Architecture
☆10Jan 11, 2018Updated 8 years ago
jiazhihao / TASO
View on GitHub
The Tensor Algebra SuperOptimizer for Deep Learning
☆743Jan 26, 2023Updated 3 years ago
ceruleangu / Block-Sparse-Benchmark
View on GitHub
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Aug 21, 2020Updated 5 years ago
awslabs / raf
View on GitHub
☆144Jan 30, 2025Updated last year
sysml / multistack
View on GitHub
☆17Sep 17, 2015Updated 10 years ago
wolegechu / ShuffleNetV2.Caffe2
View on GitHub
A Caffe2 implementation of ShuffleNet V2.
☆25Aug 6, 2018Updated 7 years ago
SeldonIO / mlgraph
View on GitHub
Machine Learning Inference Graph Spec
☆21Jul 27, 2019Updated 7 years ago
tlc-pack / tlcpack
View on GitHub
☆24Feb 20, 2024Updated 2 years ago
PAA-NCIC / GSWITCH
View on GitHub
A pattern-based algorithmic autotuner for graph processing on GPUs.
☆33Jun 25, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vancemiller / CUDA-preemption
View on GitHub
Experiments evaluating preemption on the NVIDIA Pascal architecture
☆16Nov 10, 2016Updated 9 years ago
cornell-zhang / allo-pldi24-artifact
View on GitHub
Artifact evaluation of PLDI'24 paper "Allo: A Programming Model for Composable Accelerator Design"
☆35Apr 11, 2024Updated 2 years ago
hwang595 / PyTorch-parameter-server
View on GitHub
Implementation of Parameter Server using PyTorch communication lib
☆41Apr 7, 2019Updated 7 years ago
enkiwang / Imperceptible-fake-face-antiforensic
View on GitHub
Perception Matters: Exploring Imperceptible and Transferable Anti-forensics for GAN-generated Fake Face Imagery Detection
☆11Jan 23, 2023Updated 3 years ago
Deep-Learning-Profiling-Tools / fasten
View on GitHub
☆14Apr 24, 2024Updated 2 years ago
CentML / DeepView.Predict
View on GitHub
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆14Dec 16, 2024Updated last year
zstbackcourt / TianChiFaceAttack
View on GitHub
阿里天池AI安全挑战第一期人脸识别攻击
☆10Jun 26, 2020Updated 6 years ago
deepinsight / some-resources
View on GitHub
☆10May 14, 2023Updated 3 years ago
jeongminpark417 / GIDS
View on GitHub
☆43Jun 13, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
YukeWang96 / DSXplore_IPDPS21
View on GitHub
Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.
☆13Apr 6, 2021Updated 5 years ago
sderek / CUDAAdvisor
View on GitHub
CUDAAdvisor: a GPU profiling tool
☆53Aug 24, 2018Updated 7 years ago
huiofficial / tensorflow_ckpt_2_pb
View on GitHub
A python script that can transfer ckpt file to pb file
☆10Oct 24, 2020Updated 5 years ago
TUD-OS / migros-atc-2021
View on GitHub
Repository linking to the software artifacts used for the MigrOS ATC 2021 paper
☆18May 31, 2021Updated 5 years ago
nokia / ClickNF
View on GitHub
☆31Jul 18, 2019Updated 7 years ago
Xtra-Computing / G3
View on GitHub
G3: A Programmable GNN Training System on GPU
☆43Aug 29, 2020Updated 5 years ago
UofT-EcoSystem / rlscope
View on GitHub
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
☆48Apr 7, 2021Updated 5 years ago