Vector Math Library
☆85Nov 7, 2025Updated 4 months ago
Alternatives and similar repositories for OpenVML
Users that are interested in OpenVML are comparing it to the libraries listed below
Sorting:
- GPU Automatically Tuned Linear Algebra Software☆28Sep 1, 2015Updated 10 years ago
- flexible-gemm conv of deepcore☆17Dec 2, 2019Updated 6 years ago
- a tester for BLAS libraries including OpenBLAS and Intel MKL. This project is based on ATLAS BLAS Tester☆37Feb 2, 2023Updated 3 years ago
- profiling gemm on android☆10Apr 1, 2016Updated 9 years ago
- Build-to-Order BLAS☆12Apr 9, 2019Updated 6 years ago
- OpenMPL (Open Math Performance Library) is an open source math libraries, including BLAS, LAPACK, FFT, VML, and others.☆22Aug 15, 2023Updated 2 years ago
- A managed platform and language for GPGPU☆32Dec 3, 2012Updated 13 years ago
- Experimental Linear Algebra Performance Studies☆12Feb 24, 2017Updated 9 years ago
- BLAS OpenCL implementation.☆16Apr 8, 2015Updated 10 years ago
- GCN ISA assembler tool for my GSoC project at Openwall☆35Jan 4, 2016Updated 10 years ago
- Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration☆37Jan 8, 2026Updated 2 months ago
- ☆36Mar 17, 2015Updated 11 years ago
- Support for ternary logic in SSE, XOP, AVX2 and x86 programs☆31Jan 5, 2025Updated last year
- Convert a Caffe Model to a Theano Model☆11Mar 30, 2015Updated 10 years ago
- ☆29Apr 18, 2024Updated last year
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆7,339Updated this week
- Subsumed into xnd☆25Aug 30, 2023Updated 2 years ago
- Simple, efficient and flexible vision toolbox for mxnet framework.☆31Nov 28, 2017Updated 8 years ago
- High-performance object-based library for DLA computations☆50Updated this week
- libForBES is a C++ solver for generic, constrained and possibly nonsmooth convex optimization problems. LASSO, optimal control, elastic n…☆10Apr 11, 2017Updated 8 years ago
- Sample code for classifying images into two categories using Caffe features + SVM.☆10Dec 21, 2014Updated 11 years ago
- Nonblocking data structures☆12Jan 25, 2015Updated 11 years ago
- The fundamental package for scientific computing with Python.☆22Dec 23, 2023Updated 2 years ago
- tiny dnn android, dependency-free deep learning framework in C++11 running on Android☆15Jan 15, 2017Updated 9 years ago
- ☆14Mar 27, 2016Updated 9 years ago
- A Limited-Memory Quasi-Newton Algorithm for Bound-Constrained Nonsmooth Optimization☆13Dec 23, 2016Updated 9 years ago
- project implements minimal functionality for real-time 3D cardiac electrophysiology simulation☆16Oct 24, 2016Updated 9 years ago
- Heterogeneous Run Time version of MXNet. Added heterogeneous capabilities to the MXNet, uses heterogeneous computing infrastructure frame…☆72Feb 11, 2018Updated 8 years ago
- Heterogeneous Run Time version of Caffe. Added heterogeneous capabilities to the Caffe, uses heterogeneous computing infrastructure frame…☆269Oct 16, 2018Updated 7 years ago
- Some deep learning models written with mxnet and C++11.☆12Feb 6, 2018Updated 8 years ago
- Caffe for Sparse and Low-rank Deep Neural Networks☆382Mar 8, 2020Updated 6 years ago
- An official implementation for MS-DETR in ACL'23☆17Jun 3, 2023Updated 2 years ago
- A fixed version of Robert G. Brown's "dieharder" tests for random number generators.☆13Apr 1, 2021Updated 4 years ago
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆60Apr 14, 2024Updated last year
- Torch implementation of CVPR'17 - Local Binary Convolutional Neural Networks http://xujuefei.com/lbcnn.html☆104Nov 1, 2018Updated 7 years ago
- A MXNet implementation of Xception☆20Sep 26, 2017Updated 8 years ago
- Conversion to/from half-precision floating point formats☆380Aug 16, 2025Updated 7 months ago
- A Convolutional Neural Network Cascade for Face Detection☆14May 29, 2016Updated 9 years ago
- Testing R memory usage with different malloc implementations - glibc malloc, jemalloc, tcmalloc☆15Nov 22, 2018Updated 7 years ago