ACANETS / eece-6540-labsLinks
labs and exercises for EECE.6540 Heterogeneous Computing at UMass Lowell
☆13Updated 2 years ago
Alternatives and similar repositories for eece-6540-labs
Users that are interested in eece-6540-labs are comparing it to the libraries listed below
Sorting:
- Yet another Polyhedra Compiler for DeepLearning☆19Updated 2 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 5 years ago
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- Learn NVDLA by SOMNIA☆33Updated 5 years ago
- ☆19Updated 9 months ago
- This is a repo which contains some details about how to use OpenCL backend (Xilinx/Intel).☆25Updated 5 years ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.☆17Updated 11 months ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆18Updated 4 years ago
- Sandbox for TVM and playing around!☆22Updated 2 years ago
- ☆30Updated 2 years ago
- ☆11Updated 4 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆54Updated 3 years ago
- ☆23Updated 3 years ago
- Benchmark PyTorch Custom Operators☆14Updated last year
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆14Updated 4 years ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆17Updated 2 years ago
- study of Ampere' Sparse Matmul☆18Updated 4 years ago
- ☆29Updated 4 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆26Updated 6 months ago
- ☆19Updated 5 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- OpenDLA for trying the demo and FPGA solution☆16Updated 2 years ago
- ☆14Updated 3 years ago
- Accelerate convolution neural network for face recognition using GPU☆12Updated 4 years ago
- A Toy-Purpose TPU Simulator☆19Updated last year
- TVM learning and research☆13Updated 4 years ago
- Course Webpage for CS 217 Hardware Accelerators for Machine Learning, Stanford University☆97Updated 2 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Updated 7 years ago
- CUDA project for uni subject☆23Updated 4 years ago