google / highwayLinks
Performance-portable, length-agnostic SIMD with runtime dispatch
☆5,317Updated last week
Alternatives and similar repositories for highway
Users that are interested in highway are comparing it to the libraries listed below
Sorting:
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,934Updated last week
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE, WebAssembly, VSX, RISC-…☆2,614Updated this week
- C/C++ Performance Profiler☆4,315Updated last year
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, MySQL, Chro…☆1,989Updated this week
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extens…☆1,670Updated this week
- C++ template library for high performance SIMD based sorting algorithms☆997Updated last week
- mimalloc is a compact general purpose allocator with excellent performance.☆12,439Updated this week
- Intel® Implicit SPMD Program Compiler☆2,839Updated this week
- C++ Insights - See your source code with the eyes of a compiler☆4,451Updated 7 months ago
- The compiler is available for download. Get it!☆2,545Updated 2 years ago
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,292Updated last week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,310Updated 2 years ago
- A hybrid thread / fiber task scheduler written in C++ 11☆1,992Updated 11 months ago
- Vector class library, latest version☆1,431Updated 2 years ago
- A microbenchmark support library☆9,995Updated this week
- A memory allocator that automatically reduces the memory footprint of C/C++ applications.☆1,845Updated this week
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception ha…☆1,899Updated last month
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆3,146Updated 2 months ago
- `std::execution`, the proposed C++ framework for asynchronous and parallel programming.☆2,226Updated this week
- Message passing based allocator☆1,775Updated last week
- nsync is a C library that exports various synchronization primitives, such as mutexes☆1,251Updated 3 months ago
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,397Updated 3 months ago
- nanobind: tiny and efficient C++/Python bindings☆3,326Updated this week
- A fast multi-producer, multi-consumer lock-free concurrent queue for C++11☆12,037Updated 7 months ago
- SIMD Vector Classes for C++☆1,516Updated last year
- A personal experimental C++ Syntax 2 -> Syntax 1 compiler☆5,903Updated last week
- Simple, fast, accurate single-header microbenchmarking functionality for C++11/14/17/20☆1,668Updated last year
- ☆5,108Updated this week
- Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, an…☆1,649Updated this week
- ArrayFire: a general purpose GPU library.☆4,854Updated 5 months ago