rocmarchive / realcaffe2Links
The repo is obsolete. Use at your own risk.
☆12Updated 6 years ago
Alternatives and similar repositories for realcaffe2
Users that are interested in realcaffe2 are comparing it to the libraries listed below
Sorting:
- MIOpenGEMM is now deprecated☆62Updated last year
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated last year
- ROCm OpenCL Compiler Tool Driver☆24Updated 5 years ago
- hipDNN☆45Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆121Updated this week
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 8 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last month
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- ROCm Device Libraries☆97Updated last year
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 8 years ago
- The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA…☆87Updated 4 years ago
- ROCm Parallel Primitives☆172Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆242Updated last week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Python bindings for NVTX☆66Updated last year
- CNNs in Halide☆23Updated 9 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- HCC Sample Applications☆13Updated 8 years ago
- Flexible GPGPU instrumentation☆87Updated 5 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated 2 years ago
- Tools and extensions for CUDA profiling☆64Updated 5 years ago
- High Performance Linpack for GPUs (Using OpenCL, CUDA, CAL)☆91Updated 9 years ago
- Reusable software components for ROCm developers☆84Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆178Updated 2 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆296Updated 6 years ago
- ROCm's Thunk Interface☆91Updated 2 months ago
- This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.☆42Updated last year
- Collective Knowledge repository for NVIDIA's TensorRT☆37Updated 3 years ago