sslotin / amh-codeLinks
Complete implementations from "Algorithms for Modern Hardware"
☆793Updated 3 years ago
Alternatives and similar repositories for amh-code
Users that are interested in amh-code are comparing it to the libraries listed below
Sorting:
- C++ template library for high performance SIMD based sorting algorithms☆996Updated 4 months ago
- A collection of lock-free data structures written in standard C++11☆953Updated last year
- This is a repository for the code posted on my blog☆914Updated last week
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,282Updated last week
- A curated list of awesome SIMD frameworks, libraries and software☆233Updated last year
- Vector class library, latest version☆1,429Updated last year
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception ha…☆1,898Updated last month
- ☆147Updated 2 months ago
- Slides and other materials from CppCon 2022☆561Updated 4 months ago
- Slides and other materials from CppCon 2023☆336Updated last year
- This repository contains high-performance implementations of memset and memcpy in assembly.☆341Updated 4 years ago
- This is an online course where you can learn and master the skill of low-level performance analysis and tuning.☆3,529Updated last week
- Storage for my snippets, toy programs, etc.☆373Updated 2 months ago
- A general-purpose high-performance lightweight STL-like modern C++ B-Tree☆246Updated last year
- std::simd for GCC [ISO/IEC TS 19570:2018]☆637Updated 2 years ago
- ☆45Updated last year
- A Toolkit for Programming Parallel Algorithms on Shared-Memory Multicore Machines☆418Updated 2 months ago
- Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, MySQL, Chro…☆1,977Updated this week
- Reference implementation of Dragonbox in C++☆790Updated 3 months ago
- The book "Performance Analysis and Tuning on Modern CPU"☆3,453Updated 7 months ago
- The Art of Writing Efficient Programs, published by Packt☆362Updated last year
- "See why!" Explains and suggests fixes for compile-time errors for C, C++, C#, Go, Java, LaTeX, PHP, Python, Ruby, Rust, and TypeScript☆304Updated 2 months ago
- The project provides high-performance concurrency, enabling highly parallel computation.☆231Updated 3 weeks ago
- A curated list of awesome parallel computing resources☆778Updated 2 weeks ago
- Performance Tuning Tutorial given at Oak Ridge National Laboratory☆184Updated 4 years ago
- A fast, small C/C++ function call tracer for x86-64/Linux, supports clang & gcc, ftrace, threads, exceptions & shared libraries☆193Updated 10 months ago
- Official git repository for libdivide: optimized integer division☆1,287Updated last month
- A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines☆804Updated this week
- Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extens…☆1,656Updated this week
- Lightweight structured concurrency for C++20☆157Updated last month