ACANETS / eece-6540-labsLinks
labs and exercises for EECE.6540 Heterogeneous Computing at UMass Lowell
☆13Updated 2 years ago
Alternatives and similar repositories for eece-6540-labs
Users that are interested in eece-6540-labs are comparing it to the libraries listed below
Sorting:
- BiSUNA framework specialized to compile for the Xilinx Alveo U50☆13Updated 5 years ago
- Course Webpage for CS 217 Hardware Accelerators for Machine Learning, Stanford University☆99Updated 2 years ago
- Sandbox for TVM and playing around!☆22Updated 3 years ago
- ☆33Updated 2 years ago
- ☆23Updated 4 years ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆27Updated 2 years ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 6 years ago
- The code for paper: Neuralpower: Predict and deploy energy-efficient convolutional neural networks☆22Updated 6 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Updated 4 years ago
- ☆19Updated 6 years ago
- An external memory allocator example for PyTorch.☆16Updated 4 months ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆31Updated last year
- ☆35Updated 6 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 3 years ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆17Updated 3 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Updated 4 years ago
- XRM (Xilinx FPGA Resource Manager) Document:☆25Updated 2 years ago
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Updated 4 years ago
- ☆37Updated 3 years ago
- ☆11Updated 4 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Updated 2 years ago
- ☆22Updated 10 months ago
- Benchmark PyTorch Custom Operators☆14Updated 2 years ago
- Accelerate convolution neural network for face recognition using GPU☆12Updated 5 years ago
- HW/SW co-design of sentence-level energy optimizations for latency-aware multi-task NLP inference☆53Updated last year
- study of Ampere' Sparse Matmul☆18Updated 4 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Updated 8 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆88Updated 2 years ago
- ☆14Updated 4 years ago