Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"
☆808May 3, 2024Updated 2 years ago
Alternatives and similar repositories for optimization-manual
Users that are interested in optimization-manual are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆383May 11, 2026Updated last week
- A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.☆515Mar 29, 2026Updated last month
- C++ template library for high performance SIMD based sorting algorithms☆1,011Mar 14, 2026Updated 2 months ago
- Intel PMU profiling tools☆2,225Apr 28, 2026Updated 3 weeks ago
- A benchmark for low-level CPU micro-architectural features☆767Feb 8, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tools and Reference Code for Intel Optimizations (eg Large Pages)☆147Sep 8, 2025Updated 8 months ago
- Intel® Performance Counter Monitor (Intel® PCM)☆3,271Apr 30, 2026Updated 2 weeks ago
- Vector class library, latest version☆1,452Apr 14, 2026Updated last month
- User space software for Intel(R) Resource Director Technology☆752Apr 28, 2026Updated 3 weeks ago
- The book "Performance Analysis and Tuning on Modern CPU"☆3,541Jun 9, 2025Updated 11 months ago
- The X86 Encoder Decoder (XED), is a software library for encoding and decoding X86 (IA32 and Intel64) instructions☆1,588Mar 19, 2026Updated 2 months ago
- A JIT assembler for x86/x64 architectures supporting FPU, MMX, SSE (1-4), AVX (1-2, 512), APX, and AVX10.2☆2,235Updated this week
- ☆38Jun 26, 2024Updated last year
- Device Tree-based Platform Device Driver Development for Tiano UEFI☆16Sep 8, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- libipt - an Intel(R) Processor Trace decoder library☆725Jan 8, 2026Updated 4 months ago
- Performance-portable, length-agnostic SIMD with runtime dispatch☆5,507Updated this week
- Intel® Implicit SPMD Program Compiler☆2,877May 12, 2026Updated last week
- ☆236Aug 4, 2022Updated 3 years ago
- Software artifacts for "UC-Check: Characterizing Micro-operation Caches in x86 Processors and Implications in Security and Performance" (…☆10Dec 27, 2021Updated 4 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆59Jan 7, 2023Updated 3 years ago
- OpenDCDiag is an open-source project designed to identify defects and bugs in CPUs. It consists of a set of tests built around a sophisti…☆75Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆6,644Updated this week
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆146Apr 14, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Open-source Linux performance suite for engineers—profiling and tuning workloads and system configurations.☆447Updated this week
- ☆44Jul 19, 2023Updated 2 years ago
- uops.info Code Analyzer☆344Jan 14, 2024Updated 2 years ago
- This is an online course where you can learn and master the skill of low-level performance analysis and tuning.☆3,707Updated this week
- A cross-platform x86 assembler with an Intel-like syntax☆3,195Apr 22, 2026Updated 3 weeks ago
- A microbenchmark support library☆10,203Updated this week
- ROB size testing utility☆162Dec 19, 2021Updated 4 years ago
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆229Oct 28, 2024Updated last year
- Experimental imgui app framework for rapid prototyping.☆14Aug 10, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Notes on optimizing the linux kernel function csum_partial☆14Nov 28, 2021Updated 4 years ago
- Measures the latency between CPU cores☆1,349Mar 25, 2026Updated last month
- mimalloc is a compact general purpose allocator with excellent performance.☆12,894May 8, 2026Updated last week
- This repository contains high-performance implementations of memset and memcpy in assembly.☆343Jan 10, 2022Updated 4 years ago
- Intel(R) Multi-Buffer Crypto for IPSec☆333Apr 27, 2026Updated 3 weeks ago
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,334Updated this week
- CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)☆1,173Apr 30, 2026Updated 2 weeks ago