SNU-ARC / OpenDNNLinks
OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library
☆25Updated 6 years ago
Alternatives and similar repositories for OpenDNN
Users that are interested in OpenDNN are comparing it to the libraries listed below
Sorting:
- TVM for Tenstorrent ASICs☆28Updated 3 months ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆85Updated 2 months ago
- ☆161Updated last week
- ☆110Updated last year
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆36Updated 4 years ago
- Dissecting NVIDIA GPU Architecture☆115Updated 3 years ago
- A tool for examining GPU scheduling behavior.☆89Updated last year
- ☆47Updated 5 years ago
- IREE plugin repository for the AMD AIE accelerator☆117Updated last week
- ☆121Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆146Updated last week
- Modified version of PyTorch able to work with changes to GPGPU-Sim☆57Updated 3 years ago
- Buda Compiler Backend for Tenstorrent devices☆30Updated 8 months ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆66Updated 2 months ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆120Updated 3 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆103Updated last week
- Performance Prediction Toolkit for GPUs☆39Updated 3 years ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated 2 years ago
- ☆27Updated 6 years ago
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆65Updated last year
- MLIR-based partitioning system☆153Updated last week
- ☆46Updated 6 months ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆22Updated 2 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Updated 2 years ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆124Updated last month
- Tenstorrent Kernel Module☆57Updated last week
- ☆50Updated 6 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆236Updated 3 years ago
- A home for the final text of all TVM RFCs.☆108Updated last year