nadavrot / memset_benchmarkLinks
This repository contains high-performance implementations of memset and memcpy in assembly.
☆341Updated 4 years ago
Alternatives and similar repositories for memset_benchmark
Users that are interested in memset_benchmark are comparing it to the libraries listed below
Sorting:
- Storage for my snippets, toy programs, etc.☆373Updated 2 months ago
- Heap Layers: An Extensible Memory Allocation Infrastructure☆411Updated last month
- uops.info Code Analyzer☆325Updated 2 years ago
- Suite for benchmarking malloc implementations.☆471Updated last week
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆128Updated 3 years ago
- A fast, small C/C++ function call tracer for x86-64/Linux, supports clang & gcc, ftrace, threads, exceptions & shared libraries☆193Updated 10 months ago
- A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.☆340Updated last year
- Single-header C11 port of https://abseil.io SwissTable☆169Updated 3 years ago
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆140Updated 3 months ago
- A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.☆506Updated this week
- SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification☆261Updated 4 years ago
- A benchmark for low-level CPU micro-architectural features☆760Updated 3 years ago
- This is a repository for the code posted on my blog☆914Updated last week
- Object Introspection (OI) enables on-demand, hierarchical profiling of objects in arbitrary C/C++ programs with no recompilation.☆181Updated last week
- UMASH: a fast enough hash and fingerprint with collision bounds☆159Updated 3 weeks ago
- A small study in hardware accelerated AoS reversal☆179Updated 7 years ago
- PROPELLER: Profile Guided Optimizing Large Scale LLVM-based Relinker☆497Updated this week
- Fastest CRC32 for x86, Intel and AMD, + comprehensive derivation and discussion of various approaches☆333Updated 4 years ago
- Reference implementation of Dragonbox in C++☆791Updated 3 months ago
- ☆205Updated 3 years ago
- ☆296Updated last year
- Different algorithms for converting binary to decimal floating-point numbers☆128Updated 3 years ago
- SIMD-enhanced word counter☆253Updated 6 years ago
- Testing memory-level parallelism☆82Updated last year
- Query C++ codebases using SQLite☆188Updated 3 years ago
- Open Source Architecture Code Analyzer☆348Updated last week
- C++ template library for high performance SIMD based sorting algorithms☆996Updated this week
- Bitset Sort, a faster std::sort replacement.☆79Updated 4 years ago
- Code for paper "Base64 encoding and decoding at almost the speed of a memory copy"☆205Updated 6 years ago
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆230Updated last year