mrcat2018 / AutodiffEngineLinks

AutodiffEngine

☆13

Alternatives and similar repositories for AutodiffEngine

Users that are interested in AutodiffEngine are comparing it to the libraries listed below

Sorting:

tvmai / meetup-slides
Place for meetup slides
☆140Updated 5 years ago
dlsys-course / assignment2-2018
(Spring 2018) Assignment 2: Graph Executor with TVM
☆124Updated 7 years ago
dlsys-course / tinyflow
Tutorial code on how to build your own Deep Learning System in 2k Lines
☆124Updated 8 years ago
chncwang / InsNet
InsNet Runs Instance-dependent Neural Networks with Padding-free Dynamic Batching.
☆67Updated 3 years ago
qsyao / cudaBERT
A Fast Muti-processing BERT-Inference System
☆101Updated 3 years ago
tqchen / ffi-navigator
☆247Updated 3 months ago
Harry-Chen / InfMoE
Inference framework for MoE layers based on TensorRT with Python binding
☆41Updated 4 years ago
dmlc / HalideIR
Symbolic Expression and Statement Module for new DSLs
☆205Updated 5 years ago
tlc-pack / relax
☆193Updated 2 years ago
uwsampl / relay-aot
An experimental ahead of time compiler for Relay.
☆50Updated 5 years ago
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆137Updated 3 years ago
Oneflow-Inc / DLPerf
DeepLearning Framework Performance Profiling Toolkit
☆294Updated 3 years ago
awslabs / ratex
☆23Updated 2 months ago
apache / tvm-rfcs
A home for the final text of all TVM RFCs.
☆109Updated last year
uwsampl / dtr-prototype
Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616
☆132Updated 2 years ago
anilshanbhag / gpu-topk
Efficient Top-K implementation on the GPU
☆187Updated 6 years ago
kanonjz / paper
Machine Learning System
☆14Updated 5 years ago
Funatiq / gossip
gossip: Efficient Communication Primitives for Multi-GPU Systems
☆59Updated 3 years ago
tobegit3hub / tftvm
TensorFlow and TVM integration
☆36Updated 5 years ago
thu-pacman / PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆121Updated 3 years ago
dmlc / nnvm-fusion
Kernel Fusion and Runtime Compilation Based on NNVM
☆71Updated 8 years ago
buaa-hipo / dlcompiler-comparison
The quantitative performance comparison among DL compilers on CNN models.
☆74Updated 5 years ago
henline / streamexecutordoc
Documentation for StreamExecutor open source proposal
☆83Updated 9 years ago
mmperf / mmperf
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
☆134Updated 2 years ago
awslabs / raf
☆145Updated 9 months ago
masahi / torchscript-to-tvm
☆68Updated 2 years ago
octoml / synr
A library for syntactically rewriting Python programs, pronounced (sinner).
☆68Updated 3 years ago
tiandiweizun / autodiff
200行写一个自动微分工具
☆52Updated 6 years ago
ucbrise / cs294-ai-sys-sp19
CS294; AI For Systems and Systems For AI
☆226Updated 6 years ago
andersy005 / tvm-in-action
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
☆64Updated 7 years ago