google / highway
Performance-portable, length-agnostic SIMD with runtime dispatch
☆4,590Updated this week
Alternatives and similar repositories for highway:
Users that are interested in highway are comparing it to the libraries listed below
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,653Updated last month
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,364Updated this week
- C/C++ Performance Profiler☆4,283Updated 3 months ago
- C++ template library for high performance SIMD based sorting algorithms☆929Updated last week
- The compiler is available for download. Get it!☆2,500Updated last year
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,299Updated last year
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, R…☆1,763Updated last month
- mimalloc is a compact general purpose allocator with excellent performance.☆11,347Updated this week
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,188Updated this week
- Extremely fast, in memory, JSON and interface library for modern C++☆1,827Updated this week
- C++ Insights - See your source code with the eyes of a compiler☆4,256Updated last month
- Vector class library, latest version☆1,359Updated last year
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆2,936Updated 3 weeks ago
- Message passing based allocator☆1,664Updated last week
- `std::execution`, the proposed C++ framework for asynchronous and parallel programming.☆1,879Updated this week
- A memory allocator that automatically reduces the memory footprint of C/C++ applications.☆1,811Updated 10 months ago
- A hybrid thread / fiber task scheduler written in C++ 11☆1,934Updated 2 months ago
- Compile Time Regular Expression in C++☆3,538Updated 3 weeks ago
- A personal experimental C++ Syntax 2 -> Syntax 1 compiler☆5,718Updated last week
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extens…☆1,361Updated this week
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,274Updated last month
- SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT☆724Updated 2 weeks ago
- CUDA Core Compute Libraries☆1,626Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,321Updated this week
- Intel® Implicit SPMD Program Compiler☆2,655Updated last week
- A fast single-producer, single-consumer lock-free queue for C++☆4,083Updated last week
- 📦 CMake's missing package manager. A small CMake script for setup-free, cross-platform, reproducible dependency management.☆3,404Updated this week
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,616Updated this week
- Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, an…☆1,362Updated 2 weeks ago
- A fast multi-producer, multi-consumer lock-free concurrent queue for C++11☆10,938Updated last week