☆321Feb 17, 2026Updated last month
Alternatives and similar repositories for ruy
Users that are interested in ruy are comparing it to the libraries listed below
Sorting:
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,276Updated this week
- Low-precision matrix multiplication☆1,832Jan 29, 2024Updated 2 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,543Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Sep 25, 2023Updated 2 years ago
- CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)☆1,157Mar 12, 2026Updated last week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,661Updated this week
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆945Feb 14, 2026Updated last month
- oneAPI Deep Neural Network Library (oneDNN)☆3,964Updated this week
- ☆169Apr 14, 2022Updated 3 years ago
- Conversion to/from half-precision floating point formats☆380Aug 16, 2025Updated 7 months ago
- Portable (POSIX/Windows/Emscripten) thread pool for C/C++☆388Jun 16, 2024Updated last year
- struct2tensor is a library for parsing and manipulating structured data inside of tensorflow.☆36Feb 19, 2026Updated last month
- ☆423Feb 24, 2026Updated 3 weeks ago
- A performant and modular runtime for TensorFlow☆753Sep 4, 2025Updated 6 months ago
- Highly optimized inference engine for Binarized Neural Networks☆252Updated this week
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆956Apr 11, 2025Updated 11 months ago
- npcomp - An aspirational MLIR based numpy compiler☆51Jul 31, 2020Updated 5 years ago
- XLA integration of Open Neural Network Exchange (ONNX)☆19Aug 17, 2018Updated 7 years ago
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆488Oct 23, 2025Updated 4 months ago
- nGraph has moved to OpenVINO☆1,341Oct 15, 2020Updated 5 years ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆569Sep 15, 2025Updated 6 months ago
- ☆72Mar 26, 2025Updated 11 months ago
- Open Machine Learning Compiler Framework☆13,197Updated this week
- ☆16Mar 23, 2023Updated 2 years ago
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,350Apr 14, 2025Updated 11 months ago
- int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991☆73Dec 30, 2023Updated 2 years ago
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆3,122Updated this week
- Clspv is a compiler for OpenCL C to Vulkan compute shaders☆705Mar 5, 2026Updated 2 weeks ago
- HiFi 5 NN Library☆51Oct 1, 2025Updated 5 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,549Aug 28, 2019Updated 6 years ago
- common in-memory tensor structure☆1,177Jan 26, 2026Updated last month
- Performance-portable, length-agnostic SIMD with runtime dispatch☆5,395Updated this week
- Compiler for Neural Network hardware accelerators☆3,326May 11, 2024Updated last year
- Arm NN ML Software.☆1,299Jan 23, 2026Updated last month
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,566Mar 14, 2026Updated last week
- ☆20Oct 26, 2015Updated 10 years ago
- heterogeneity-aware-lowering-and-optimization☆257Jan 20, 2024Updated 2 years ago
- a language for fast, portable data-parallel computation☆6,601Updated this week
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆48Mar 12, 2026Updated last week