KhronosGroup / NNEF-Registry
Neural Network Exchange Format registry
☆12Updated last year
Related projects: ⓘ
- An ONNX backend using PlaidML☆28Updated 6 years ago
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- nGraph™ Backend for ONNX☆42Updated last year
- Scoreboard for ONNX Backend Compatibility☆24Updated this week
- A portable high-level API with CUDA or OpenCL back-end☆53Updated 6 years ago
- npcomp - An aspirational MLIR based numpy compiler☆50Updated 4 years ago
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆17Updated 5 years ago
- CNNs in Halide☆22Updated 8 years ago
- NNVM for ROCm Examples☆19Updated 6 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Build TVM docker image for production compilation deployments☆13Updated 3 years ago
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 6 years ago
- ☆26Updated last year
- Automatically insert nvtx ranges to PyTorch models☆17Updated 3 years ago
- Python Binding to NVRTC☆79Updated 6 years ago
- Input-aware cuBLAS/clBLAS implementation for better performance☆17Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.☆42Updated last year
- Accelerating DNN Convolutional Layers with Micro-batches☆64Updated 4 years ago
- Example code used in the CVPR 2015 tutorial☆38Updated 8 years ago
- Notes and artifacts from the ONNX steering committee☆24Updated this week
- portDNN is a library implementing neural network algorithms written using SYCL☆106Updated 3 months ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆39Updated 5 years ago
- Cooperative Primitives for CUDA C++ Kernel Authors. This repository contains CUB PRs from Q4 2019 until Q4 2020.☆22Updated 3 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- Tools and extensions for CUDA profiling☆63Updated 4 years ago
- MXNet - nGraph integration☆34Updated 2 years ago
- ☆14Updated 5 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- AI-related samples made available by the DevTech ProViz team☆27Updated 5 months ago