google / highway
Performance-portable, length-agnostic SIMD with runtime dispatch
☆4,202Updated this week
Related projects ⓘ
Alternatives and complementary repositories for highway
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, R…☆1,573Updated last week
- C/C++ Performance Profiler☆4,210Updated this week
- mimalloc is a compact general purpose allocator with excellent performance.☆10,568Updated this week
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,387Updated last month
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,293Updated 9 months ago
- Vector class library, latest version☆1,302Updated 9 months ago
- Message passing based allocator☆1,574Updated last week
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,156Updated 4 months ago
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,198Updated last week
- The compiler is available for download. Get it!☆2,385Updated last year
- A microbenchmark support library☆9,020Updated this week
- C++ Insights - See your source code with the eyes of a compiler☆4,096Updated 2 weeks ago
- A heap memory profiler for Linux☆3,329Updated 3 weeks ago
- C++ template library for high performance SIMD based sorting algorithms☆878Updated this week
- Intel® Implicit SPMD Program Compiler☆2,516Updated this week
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆2,540Updated last week
- A memory allocator that automatically reduces the memory footprint of C/C++ applications.☆1,753Updated 4 months ago
- A hybrid thread / fiber task scheduler written in C++ 11☆1,878Updated 3 months ago
- The book "Performance Analysis and Tuning on Modern CPU"☆2,147Updated this week
- Binary Optimization and Layout Tool - A linux command-line utility used for optimizing performance of binaries☆2,516Updated last year
- oneAPI Threading Building Blocks (oneTBB)☆5,717Updated this week
- Frame profiler☆10,156Updated this week
- Low-latency machine code generation☆3,958Updated 2 weeks ago
- A cross platform C99 library to get cpu features at runtime.☆2,460Updated this week
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆961Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆2,831Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,919Updated 9 months ago
- Mold: A Modern Linker 🦠☆14,344Updated last week
- A personal experimental C++ Syntax 2 -> Syntax 1 compiler☆5,523Updated this week
- A tool for use with clang to analyze #includes in C and C++ source files☆4,115Updated this week