google / highwayLinks
Performance-portable, length-agnostic SIMD with runtime dispatch
☆5,155Updated last week
Alternatives and similar repositories for highway
Users that are interested in highway are comparing it to the libraries listed below
Sorting:
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,869Updated 2 weeks ago
- C/C++ Performance Profiler☆4,308Updated 10 months ago
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE, WebAssembly, VSX, RISC-…☆2,557Updated this week
- C++ template library for high performance SIMD based sorting algorithms☆984Updated 2 months ago
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, MySQL, Chro…☆1,931Updated 2 weeks ago
- Vector class library, latest version☆1,416Updated last year
- Intel® Implicit SPMD Program Compiler☆2,794Updated this week
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆3,094Updated last week
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extens…☆1,588Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,307Updated last year
- mimalloc is a compact general purpose allocator with excellent performance.☆12,229Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,488Updated this week
- A hybrid thread / fiber task scheduler written in C++ 11☆1,983Updated 9 months ago
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,278Updated this week
- The compiler is available for download. Get it!☆2,541Updated 2 years ago
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception ha…☆1,879Updated 2 months ago
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,366Updated last month
- nsync is a C library that exports various synchronization primitives, such as mutexes☆1,235Updated last month
- Message passing based allocator☆1,753Updated last month
- oneAPI Threading Building Blocks (oneTBB)☆6,448Updated last week
- `std::execution`, the proposed C++ framework for asynchronous and parallel programming.☆2,123Updated this week
- Frame profiler☆14,546Updated last week
- A microbenchmark support library☆9,874Updated this week
- C++ Insights - See your source code with the eyes of a compiler☆4,419Updated 5 months ago
- A cross platform C99 library to get cpu features at runtime.☆2,563Updated 2 weeks ago
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,989Updated last year
- A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion☆1,231Updated last month
- nanobind: tiny and efficient C++/Python bindings☆3,180Updated this week
- Compile Time Regular Expression in C++☆3,714Updated 2 months ago
- SIMD Vector Classes for C++☆1,512Updated last year