ACANETS / eece-6540-labsLinks

labs and exercises for EECE.6540 Heterogeneous Computing at UMass Lowell

☆13

Alternatives and similar repositories for eece-6540-labs

Users that are interested in eece-6540-labs are comparing it to the libraries listed below

Sorting:

ModelTC / pyvlova
Yet another Polyhedra Compiler for DeepLearning
☆19Updated 2 years ago
enyac-group / NeuralPower
The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks
☆21Updated 5 years ago
zhuzilin / pytorch-malloc
An external memory allocator example for PyTorch.
☆14Updated 3 years ago
soDLA-publishment / somnia
Learn NVDLA by SOMNIA
☆33Updated 5 years ago
LeiWang1999 / Stream-k.tvm
☆19Updated 9 months ago
ybai62868 / OpenCL_Xilinx-Intel_HeteroCL
This is a repo which contains some details about how to use OpenCL backend (Xilinx/Intel).
☆25Updated 5 years ago
nycu-caslab / TinyTS
This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.
☆17Updated 11 months ago
tlc-pack / TLCBench
Benchmark scripts for TVM
☆74Updated 3 years ago
jack-willturner / nas-as-program-transformation-exploration
The code for our paper "Neural Architecture Search as Program Transformation Exploration"
☆18Updated 4 years ago
AndrewZhaoLuo / TVM-Sandbox
Sandbox for TVM and playing around!
☆22Updated 2 years ago
WuDan0399 / Integrate-NVDLA-and-TVM
☆30Updated 2 years ago
chips-compilers-mlsys-21 / chips-compilers-mlsys-21.github.io
☆11Updated 4 years ago
pigirons / conv3x3_m1
This is a demo how to write a high performance convolution run on apple silicon
☆54Updated 3 years ago
jgoeders / dac_sdc_2021
☆23Updated 3 years ago
comaniac / epoi
Benchmark PyTorch Custom Operators
☆14Updated last year
limenghao / AdaTune
This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).
☆14Updated 4 years ago
CAS-CLab / BlockConv
[TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA
☆17Updated 2 years ago
lenLRX / AmpereSparseMatmul
study of Ampere' Sparse Matmul
☆18Updated 4 years ago
areusch / microtvm-blogpost-eval
☆29Updated 4 years ago
microsoft / FractalTensor
FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …
☆26Updated 6 months ago
microideax / T-DLA
☆19Updated 5 years ago
anony-sub / chameleon
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
☆27Updated 5 years ago
SCLUO / ITRI-OpenDLA
OpenDLA for trying the demo and FPGA solution
☆16Updated 2 years ago
union-codesign / union
☆14Updated 3 years ago
WalkerLau / GPU-CNN
Accelerate convolution neural network for face recognition using GPU
☆12Updated 4 years ago
jinhachung / tptpu-sim
A Toy-Purpose TPU Simulator
☆19Updated last year
CharlieCurry / tvm-learning
TVM learning and research
☆13Updated 4 years ago
cs217 / cs217.github.io
Course Webpage for CS 217 Hardware Accelerators for Machine Learning, Stanford University
☆97Updated 2 years ago
quettabit / convolution_kernel
Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.
☆14Updated 7 years ago
piojanu / CUDA-im2col-conv
CUDA project for uni subject
☆23Updated 4 years ago