ARM-software / optimized-routines
Optimized implementations of various library functions for ARM architecture processors
☆625Updated last week
Alternatives and similar repositories for optimized-routines
Users that are interested in optimized-routines are comparing it to the libraries listed below
Sorting:
- Arm C Language Extensions (ACLE)☆105Updated 2 weeks ago
- The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using S…☆457Updated last week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆726Updated 3 weeks ago
- AutoFDO☆560Updated 2 weeks ago
- A benchmark for low-level CPU micro-architectural features☆719Updated 3 years ago
- Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"☆794Updated last year
- Suite for benchmarking malloc implementations.☆419Updated last week
- This repository contains high-performance implementations of memset and memcpy in assembly.☆330Updated 3 years ago
- A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.☆471Updated 2 months ago
- Application Binary Interface for the Arm® Architecture☆1,060Updated this week
- Open Source Architecture Code Analyzer☆320Updated last week
- An open optimized software library project for the ARM® Architecture☆1,486Updated 2 years ago
- CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)☆1,083Updated 2 weeks ago
- Makes ARM NEON documentation accessible (with examples)☆393Updated last year
- Simple benchmark for memory throughput and latency☆378Updated last year
- Sources for Arm Streamline's gator daemon, part of Arm Mobile Studio suite of performance analysis tools☆137Updated last week
- A tool which profiles OpenCL devices to find their peak capacities☆444Updated 4 months ago
- A C library that may be linked into a C/C++ program to produce symbolic backtraces☆1,067Updated last month
- uops.info Code Analyzer☆269Updated last year
- Copy of instlatx64.atw.hu☆217Updated this week
- Official git repository for libdivide: optimized integer division☆1,197Updated this week
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆137Updated last month
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆212Updated 6 months ago
- PROPELLER: Profile Guided Optimizing Large Scale LLVM-based Relinker☆414Updated this week
- Device Tree Compiler☆270Updated 2 weeks ago
- C++ template library for high performance SIMD based sorting algorithms☆932Updated 2 weeks ago
- Encapsulate the frequently used AVX instructions as independent modules to reduce repeated development workload.☆121Updated last year
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,672Updated last week
- Portable (POSIX/Windows/Emscripten) thread pool for C/C++☆366Updated 11 months ago
- pocl - Portable Computing Language☆987Updated last week