KhronosGroup / NNEF-Registry
Neural Network Exchange Format registry
☆12Updated last year
Alternatives and similar repositories for NNEF-Registry:
Users that are interested in NNEF-Registry are comparing it to the libraries listed below
- Scoreboard for ONNX Backend Compatibility☆28Updated this week
- An ONNX backend using PlaidML☆28Updated 6 years ago
- CNNs in Halide☆23Updated 9 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- Python Binding to NVRTC☆79Updated 5 months ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- CuPy Benchmark☆12Updated 5 years ago
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 6 years ago
- ☆14Updated 6 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 4 years ago
- MXNet - nGraph integration☆34Updated 3 years ago
- ☆26Updated 2 years ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- Repository for ONNX working group artifacts☆24Updated 2 months ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Codebase associated with the PyTorch compiler tutorial☆46Updated 5 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 4 years ago
- ROCm OpenCL Compiler Tool Driver☆24Updated 5 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.☆42Updated last year
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- ☆10Updated 2 years ago
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆18Updated 6 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 10 months ago
- Distributed machine learning platform☆12Updated 9 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- NNVM for ROCm Examples☆19Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- tutorial to optimize GEMM performance on android☆51Updated 9 years ago