google / highwayLinks
Performance-portable, length-agnostic SIMD with runtime dispatch
☆5,140Updated last week
Alternatives and similar repositories for highway
Users that are interested in highway are comparing it to the libraries listed below
Sorting:
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,851Updated 3 weeks ago
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,538Updated this week
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, MySQL, Chro…☆1,915Updated 2 weeks ago
- C++ template library for high performance SIMD based sorting algorithms☆983Updated 2 months ago
- C/C++ Performance Profiler☆4,307Updated 9 months ago
- The compiler is available for download. Get it!☆2,536Updated 2 years ago
- Vector class library, latest version☆1,406Updated last year
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extens…☆1,557Updated this week
- Intel® Implicit SPMD Program Compiler☆2,780Updated this week
- The book "Performance Analysis and Tuning on Modern CPU"☆3,365Updated 5 months ago
- Message passing based allocator☆1,746Updated 3 weeks ago
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,273Updated this week
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆3,080Updated 3 weeks ago
- nanobind: tiny and efficient C++/Python bindings☆3,129Updated last week
- This is an online course where you can learn and master the skill of low-level performance analysis and tuning.☆3,342Updated last week
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception ha…☆1,873Updated 2 months ago
- `std::execution`, the proposed C++ framework for asynchronous and parallel programming.☆2,090Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,456Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,309Updated last year
- A personal experimental C++ Syntax 2 -> Syntax 1 compiler☆5,832Updated last month
- A hybrid thread / fiber task scheduler written in C++ 11☆1,982Updated 8 months ago
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,360Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆6,426Updated this week
- SIMD Vector Classes for C++☆1,512Updated last year
- mimalloc is a compact general purpose allocator with excellent performance.☆12,144Updated last week
- Compile Time Regular Expression in C++☆3,699Updated 2 months ago
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,361Updated 3 weeks ago
- A memory allocator that automatically reduces the memory footprint of C/C++ applications.☆1,833Updated last year
- Extremely fast, in memory, JSON and reflection library for modern C++☆2,200Updated last week
- nsync is a C library that exports various synchronization primitives, such as mutexes☆1,234Updated 2 weeks ago