ACANETS / eece-6540-labsLinks
labs and exercises for EECE.6540 Heterogeneous Computing at UMass Lowell
☆13Updated 2 years ago
Alternatives and similar repositories for eece-6540-labs
Users that are interested in eece-6540-labs are comparing it to the libraries listed below
Sorting:
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆23Updated 6 years ago
- ☆11Updated 4 years ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆27Updated 2 years ago
- Yet another Polyhedra Compiler for DeepLearning☆19Updated 2 years ago
- An external memory allocator example for PyTorch.☆16Updated 5 months ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 3 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 6 years ago
- ☆33Updated 2 years ago
- Course Webpage for CS 217 Hardware Accelerators for Machine Learning, Stanford University☆100Updated 3 weeks ago
- TQT's pytorch implementation.☆21Updated 4 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆57Updated 3 years ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆17Updated 3 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆30Updated 3 years ago
- BiSUNA framework specialized to compile for the Xilinx Alveo U50☆13Updated 5 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆51Updated 2 years ago
- ☆23Updated 4 years ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Updated 8 years ago
- HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference☆54Updated last year
- Learn NVDLA by SOMNIA☆42Updated 6 years ago
- ☆24Updated 3 years ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.☆19Updated 5 months ago
- ☆170Updated 2 years ago
- study of Ampere' Sparse Matmul☆18Updated 5 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆29Updated 4 years ago
- Sandbox for TVM and playing around!☆22Updated 3 years ago
- A collection of research papers on efficient training of DNNs☆70Updated 3 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆88Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated 2 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆31Updated last year