marcin-osowski / cmov
Measuring cmov vs branch-mov performance
☆78Updated 6 years ago
Related projects: ⓘ
- A terminal viewer for x86 instruction/intrinsic information using Python 3 + curses☆127Updated last year
- Very low-overhead timer/counter interfaces for C on Intel 64 processors.☆116Updated 4 years ago
- Reworking of Agner Fog's performance test programs for Linux☆110Updated 5 years ago
- Tweaked version of "Aha" - "A Hacker's Assistant" superoptimiser by Henry S. Warren☆57Updated 2 years ago
- Quick sort code using AVX2 instructions☆67Updated 7 years ago
- Fast Hash Functions Using AES Intrinsics☆79Updated 5 years ago
- ☆54Updated 9 years ago
- A dynamically safe implementation of C, using your existing C compiler. Tolerates idiomatic C code pretty well. Not perfect... yet.☆100Updated last week
- Programatically obtain information about the pages backing a given memory region☆71Updated 2 years ago
- A small library and kernel module for easy access to x86 performance monitor counters under Linux.☆91Updated 4 months ago
- Experiments with array layouts for comparison-based searching☆80Updated 8 months ago
- Restartable Sequences: a userspace implementation of cheap per-cpu atomic operations☆32Updated 5 years ago
- Testing memory-level parallelism☆64Updated 6 months ago
- Random Number Generator based on hardware-accelerated AES instructions☆56Updated 5 years ago
- Create man pages from information used by Intel Intrinsics Guide and optionally uops.info☆42Updated 2 years ago
- Markup source code showing optimizations☆35Updated 4 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆57Updated last year
- Poireau: a sampling allocation debugger☆86Updated 2 years ago
- Record "perf" performance metrics for individual functions/regions of an ELF binary.☆69Updated 8 months ago
- InstLatX64_Demo☆41Updated last month
- Dynamic runtime inlining with LLVM☆65Updated 2 years ago
- ☆49Updated 6 months ago
- A small DFA for under 16 states☆52Updated 6 years ago
- Fast differential coding functions (using SIMD instructions)☆49Updated 6 years ago
- Delta Pointers: Buffer Overflow Checks Without the Checks (EuroSys'18)☆51Updated 2 years ago
- uops.info Code Analyzer☆229Updated 8 months ago
- Working draft of nextgen malloc implementation for musl libc☆116Updated 3 years ago
- fast SIMD-able JIT regular expression compiler☆189Updated 9 years ago
- A Wait-Free Universal Construct for Large Objects☆95Updated 4 years ago
- Portable Runtime System☆22Updated 8 years ago