google / highway
Performance-portable, length-agnostic SIMD with runtime dispatch
☆4,409Updated this week
Alternatives and similar repositories for highway:
Users that are interested in highway are comparing it to the libraries listed below
- C++ template library for high performance SIMD based sorting algorithms☆914Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,297Updated last year
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, R…☆1,701Updated last week
- C/C++ Performance Profiler☆4,262Updated 3 weeks ago
- Implementations of SIMD instruction sets for systems which don't natively support them.☆2,531Updated last week
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,309Updated this week
- mimalloc is a compact general purpose allocator with excellent performance.☆11,004Updated this week
- Vector class library, latest version☆1,337Updated last year
- A family of header-only, very fast and memory-friendly hashmap and btree containers.☆2,842Updated last week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆2,998Updated this week
- C++ Insights - See your source code with the eyes of a compiler☆4,193Updated 3 weeks ago
- The compiler is available for download. Get it!☆2,479Updated last year
- oneAPI Threading Building Blocks (oneTBB)☆5,947Updated this week
- A hybrid thread / fiber task scheduler written in C++ 11☆1,916Updated this week
- A memory allocator that automatically reduces the memory footprint of C/C++ applications.☆1,791Updated 7 months ago
- nanobind: tiny and efficient C++/Python bindings☆2,597Updated this week
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extens…☆1,287Updated this week
- `std::execution`, the proposed C++ framework for asynchronous and parallel programming.☆1,763Updated this week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆2,089Updated this week
- Message passing based allocator☆1,624Updated last week
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,144Updated this week
- Mold: A Modern Linker 🦠☆14,859Updated this week
- Compile Time Regular Expression in C++☆3,472Updated last week
- Public domain cross platform lock free thread caching 16-byte aligned memory allocator implemented in C☆2,230Updated last month
- Abseil Common Libraries (C++)☆15,474Updated this week
- A fast multi-producer, multi-consumer lock-free concurrent queue for C++11☆10,662Updated 3 weeks ago
- Frame profiler☆10,790Updated this week
- A microbenchmark support library☆9,265Updated this week
- A General-purpose Task-parallel Programming System using Modern C++☆10,562Updated this week
- Official git repository for libdivide: optimized integer division☆1,148Updated 3 weeks ago