ACANETS / eece-6540-labsLinks
labs and exercises for EECE.6540 Heterogeneous Computing at UMass Lowell
☆13Updated 2 years ago
Alternatives and similar repositories for eece-6540-labs
Users that are interested in eece-6540-labs are comparing it to the libraries listed below
Sorting:
- Yet another Polyhedra Compiler for DeepLearning☆19Updated 2 years ago
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆18Updated 4 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆54Updated 3 years ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.☆17Updated last year
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆28Updated 6 months ago
- ☆69Updated 2 years ago
- Framework to reduce autotune overhead to zero for well known deployments.☆79Updated this week
- ☆11Updated 4 years ago
- ☆19Updated 9 months ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- Benchmark PyTorch Custom Operators☆14Updated 2 years ago
- Benchmark tests supporting the TiledCUDA library.☆16Updated 7 months ago
- GPTQ inference TVM kernel☆40Updated last year
- Triton adapter for Ascend. Mirror of https://gitee.com/ascend/triton-ascend☆59Updated this week
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- BiSUNA framework specialized to compile for the Xilinx Alveo U50☆12Updated 4 years ago
- Course Webpage for CS 217 Hardware Accelerators for Machine Learning, Stanford University☆98Updated 2 years ago
- ☆24Updated 2 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated 2 years ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆17Updated 3 years ago
- Sandbox for TVM and playing around!☆22Updated 2 years ago
- study of Ampere' Sparse Matmul☆18Updated 4 years ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆26Updated 2 years ago
- ☆20Updated 2 years ago
- CSV spreadsheets and other material for AI accelerator survey papers☆172Updated last year
- ☆39Updated 5 years ago
- ☆31Updated 2 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆21Updated 6 years ago