google / highwayLinks
Performance-portable, length-agnostic SIMD with runtime dispatch
☆5,011Updated this week
Alternatives and similar repositories for highway
Users that are interested in highway are comparing it to the libraries listed below
Sorting:
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,792Updated this week
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,477Updated 2 weeks ago
- C++ template library for high performance SIMD based sorting algorithms☆962Updated this week
- C/C++ Performance Profiler☆4,310Updated 7 months ago
- Vector class library, latest version☆1,394Updated last year
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, R…☆1,868Updated last week
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extens…☆1,505Updated last week
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,242Updated this week
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception ha…☆1,854Updated 3 weeks ago
- mimalloc is a compact general purpose allocator with excellent performance.☆11,915Updated last week
- C++ Insights - See your source code with the eyes of a compiler☆4,380Updated 2 months ago
- The book "Performance Analysis and Tuning on Modern CPU"☆3,304Updated 3 months ago
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆3,052Updated last week
- Intel® Implicit SPMD Program Compiler☆2,744Updated last week
- Message passing based allocator☆1,728Updated 3 weeks ago
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,310Updated last year
- `std::execution`, the proposed C++ framework for asynchronous and parallel programming.☆2,016Updated this week
- This is an online course where you can learn and master the skill of low-level performance analysis and tuning.☆3,243Updated last week
- The compiler is available for download. Get it!☆2,528Updated last year
- SIMD Vector Classes for C++☆1,505Updated last year
- Frame profiler☆12,713Updated last week
- The Linux perf GUI for performance analysis.☆4,839Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆6,325Updated this week
- nsync is a C library that exports various synchronization primitives, such as mutexes☆1,208Updated this week
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,691Updated last week
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆754Updated last week
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,342Updated last month
- A memory allocator that automatically reduces the memory footprint of C/C++ applications.☆1,826Updated last year
- A hybrid thread / fiber task scheduler written in C++ 11☆1,967Updated 6 months ago
- Compile Time Regular Expression in C++☆3,652Updated 3 months ago