☆321Feb 17, 2026Updated last week
Alternatives and similar repositories for ruy
Users that are interested in ruy are comparing it to the libraries listed below
Sorting:
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,263Updated this week
- Low-precision matrix multiplication☆1,831Jan 29, 2024Updated 2 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,534Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆139Sep 25, 2023Updated 2 years ago
- CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)☆1,153Feb 18, 2026Updated last week
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,006Sep 19, 2024Updated last year
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,614Updated this week
- Conversion to/from half-precision floating point formats☆379Aug 16, 2025Updated 6 months ago
- struct2tensor is a library for parsing and manipulating structured data inside of tensorflow.☆36Feb 19, 2026Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,956Updated this week
- Portable (POSIX/Windows/Emscripten) thread pool for C/C++☆387Jun 16, 2024Updated last year
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆938Feb 14, 2026Updated 2 weeks ago
- ☆422Jan 4, 2026Updated last month
- "Multi-Level Intermediate Representation" Compiler Infrastructure☆1,762Apr 22, 2021Updated 4 years ago
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,349Apr 14, 2025Updated 10 months ago
- Highly optimized inference engine for Binarized Neural Networks☆252Feb 18, 2026Updated last week
- A performant and modular runtime for TensorFlow☆753Sep 4, 2025Updated 5 months ago
- ☆16Mar 23, 2023Updated 2 years ago
- nGraph has moved to OpenVINO☆1,344Oct 15, 2020Updated 5 years ago
- ☆127Feb 17, 2026Updated last week
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆486Oct 23, 2025Updated 4 months ago
- common in-memory tensor structure☆1,169Jan 26, 2026Updated last month
- ☆20Oct 26, 2015Updated 10 years ago
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,120Updated this week
- QA dashboard for DV360 advertisers☆13Jan 20, 2021Updated 5 years ago
- JsInterop java annotations for J2CL and GWT☆23Feb 6, 2026Updated 3 weeks ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆704Updated this week
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆956Apr 11, 2025Updated 10 months ago
- Compiler for Neural Network hardware accelerators☆3,326May 11, 2024Updated last year
- Performance-portable, length-agnostic SIMD with runtime dispatch☆5,346Updated this week
- a language for fast, portable data-parallel computation☆6,577Updated this week
- Open Machine Learning Compiler Framework☆13,142Updated this week
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆18Mar 8, 2024Updated last year
- ☆169Apr 14, 2022Updated 3 years ago
- ☆12Sep 1, 2022Updated 3 years ago
- ☆13May 6, 2024Updated last year
- convert the deep-residual-network(50, 101, 152) from caffe to mxnet☆11Aug 26, 2016Updated 9 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Jul 31, 2020Updated 5 years ago