(Deprecated) hipCaffe: the HIP port of Caffe
☆124May 1, 2024Updated last year
Alternatives and similar repositories for hipCaffe
Users that are interested in hipCaffe are comparing it to the libraries listed below
Sorting:
- ☆14Mar 21, 2019Updated 6 years ago
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆438Jun 10, 2020Updated 5 years ago
- benchmarking miopen☆17Jan 14, 2019Updated 7 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆255Feb 10, 2026Updated 2 weeks ago
- Fast binary matrix product on CPU☆10Feb 11, 2016Updated 10 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆29Dec 30, 2019Updated 6 years ago
- MIOpenGEMM is now deprecated☆61Jul 17, 2023Updated 2 years ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- ROCm OpenCL Compiler Tool Driver☆24Nov 22, 2019Updated 6 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆390Updated this week
- The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA…☆85Jun 16, 2020Updated 5 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆199Feb 20, 2026Updated last week
- OpenCL support for TensorFlow via SYCL☆65Jul 9, 2018Updated 7 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Jun 16, 2017Updated 8 years ago
- OpenCL support for TensorFlow☆476Oct 26, 2017Updated 8 years ago
- Dockerfiles for the various software layers defined in the ROCm software platform☆510Jan 27, 2026Updated last month
- Fork of the Blaze library for compatibility with Blaze CUDA · https://bitbucket.org/blaze-lib/blaze · https://github.com/STEllAR-GROUP/bl…☆10Oct 17, 2019Updated 6 years ago
- ☆38Mar 1, 2017Updated 9 years ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆530Aug 31, 2018Updated 7 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆55Jan 22, 2026Updated last month
- Compute applications.☆25Dec 12, 2019Updated 6 years ago
- HSAIL Instruction Set Simulator - for testing execution of HSAIL Brig files☆36Sep 18, 2014Updated 11 years ago
- Documents and source code related to a Hybrid HPL run for IU's BR2 machine☆16Nov 27, 2012Updated 13 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆411Updated this week
- This repository contains multiple implementations of Flash Attention optimized with Triton kernels, showcasing progressive performance im…☆11Jan 19, 2026Updated last month
- Dashboard style widgets for displaying elasticsearch database results and statistics.☆11Aug 21, 2015Updated 10 years ago
- NTRUEncrypt: im in ur quantum box, maybe☆13Apr 18, 2019Updated 6 years ago
- HIP: C++ Heterogeneous-Compute Interface for Portability☆4,332Feb 10, 2026Updated 2 weeks ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆31Sep 19, 2024Updated last year
- Caffe re-implementation of ShuffleNet☆106Apr 1, 2018Updated 7 years ago
- ☆12May 3, 2020Updated 5 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Jul 7, 2017Updated 8 years ago
- ☆34Mar 25, 2017Updated 8 years ago
- OpenCL 1.2 implementation for Tensorflow☆797Oct 15, 2022Updated 3 years ago
- TensorFlow ROCm port☆699Updated this week
- ☆16Nov 19, 2025Updated 3 months ago
- Tutorials to GPU programming. Reading notes.☆19Apr 27, 2023Updated 2 years ago
- Benchmarks of Deep Neural Networks☆39May 19, 2021Updated 4 years ago
- A matconvnet implementation of the Single Shot Detector☆36Jan 23, 2019Updated 7 years ago