SNU-ARC / OpenDNNLinks
OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library
☆25Updated 5 years ago
Alternatives and similar repositories for OpenDNN
Users that are interested in OpenDNN are comparing it to the libraries listed below
Sorting:
- ☆109Updated last year
- TVM for Tenstorrent ASICs☆27Updated 2 months ago
- ☆119Updated last week
- IREE plugin repository for the AMD AIE accelerator☆113Updated last week
- ☆47Updated 4 years ago
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆35Updated 4 years ago
- ☆159Updated this week
- Dissecting NVIDIA GPU Architecture☆112Updated 3 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆234Updated 3 years ago
- Conversions to MLIR EmitC☆134Updated 11 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆145Updated last week
- Buda Compiler Backend for Tenstorrent devices☆30Updated 8 months ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆84Updated last month
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Updated 2 years ago
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆61Updated 8 months ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated 2 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆102Updated this week
- ☆15Updated 3 weeks ago
- An extension library of WMMA API (Tensor Core API)☆109Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆137Updated last week
- GPTPU for SC 2021☆52Updated 2 years ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆33Updated 2 months ago
- A tool for examining GPU scheduling behavior.☆89Updated last year
- A home for the final text of all TVM RFCs.☆108Updated last year
- Performance Prediction Toolkit for GPUs☆39Updated 3 years ago
- MLIR-based partitioning system☆151Updated this week
- TPP experimentation on MLIR for linear algebra☆139Updated this week
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆137Updated 11 months ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆117Updated 3 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆180Updated 3 years ago