Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"
☆812May 3, 2024Updated 2 years ago
Alternatives and similar repositories for optimization-manual
Users that are interested in optimization-manual are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆392Jun 22, 2026Updated last week
- A tool for running small microbenchmarks on recent Intel and AMD x86 CPUs.☆523Mar 29, 2026Updated 3 months ago
- C++ template library for high performance SIMD based sorting algorithms☆1,012Updated this week
- Intel PMU profiling tools☆2,233Jun 17, 2026Updated last week
- A benchmark for low-level CPU micro-architectural features☆770Feb 8, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Tools and Reference Code for Intel Optimizations (eg Large Pages)☆147Sep 8, 2025Updated 9 months ago
- Intel® Performance Counter Monitor (Intel® PCM)☆3,290Jun 19, 2026Updated last week
- Vector class library, latest version☆1,457Apr 14, 2026Updated 2 months ago
- User space software for Intel(R) Resource Director Technology☆750Updated this week
- The book "Performance Analysis and Tuning on Modern CPU"☆3,576Jun 9, 2025Updated last year
- The X86 Encoder Decoder (XED), is a software library for encoding and decoding X86 (IA32 and Intel64) instructions☆1,597May 20, 2026Updated last month
- A JIT assembler for x86/x64 architectures supporting FPU, MMX, SSE (1-4), AVX (1-2, 512), APX, and AVX10.2☆2,251Jun 19, 2026Updated last week
- ☆38Jun 26, 2024Updated 2 years ago
- libipt - an Intel(R) Processor Trace decoder library☆730May 19, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Performance-portable, length-agnostic SIMD with runtime dispatch☆5,644Updated this week
- Intel® Implicit SPMD Program Compiler☆2,912Updated this week
- ☆237Aug 4, 2022Updated 3 years ago
- Software artifacts for "UC-Check: Characterizing Micro-operation Caches in x86 Processors and Implications in Security and Performance" (…☆10Dec 27, 2021Updated 4 years ago
- immintrin_dbg.h is an include file, a wrapper around immintrin.h. It implements most of AVX, AVX2, AVX-512 vector intrinsics to enable so…☆59Jan 7, 2023Updated 3 years ago
- OpenDCDiag is an open-source project designed to identify defects and bugs in CPUs. It consists of a set of tests built around a sophisti…☆76Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆6,680Updated this week
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆146Apr 14, 2026Updated 2 months ago
- Open-source Linux performance suite for engineers—profiling and tuning workloads and system configurations.☆450Jun 22, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆44Jul 19, 2023Updated 2 years ago
- uops.info Code Analyzer☆351Jan 14, 2024Updated 2 years ago
- This is an online course where you can learn and master the skill of low-level performance analysis and tuning.☆3,748Jun 17, 2026Updated last week
- A cross-platform x86 assembler with an Intel-like syntax☆3,243Jun 21, 2026Updated last week
- A microbenchmark support library☆10,249Updated this week
- ROB size testing utility☆163Dec 19, 2021Updated 4 years ago
- Test the non-AVX, AVX2 and AVX-512 speeds across various active core counts☆229Oct 28, 2024Updated last year
- Experimental imgui app framework for rapid prototyping.☆14Aug 10, 2025Updated 10 months ago
- Measures the latency between CPU cores☆1,356Mar 25, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- mimalloc is a compact general purpose allocator with excellent performance.☆13,113Jun 22, 2026Updated last week
- This repository contains high-performance implementations of memset and memcpy in assembly.☆342Jan 10, 2022Updated 4 years ago
- Intel(R) Multi-Buffer Crypto for IPSec☆337Updated this week
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆1,344Jun 21, 2026Updated last week
- CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)☆1,175Jun 17, 2026Updated last week
- Copy of instlatx64.atw.hu☆254May 17, 2026Updated last month
- mold: A Modern Linker 🦠☆16,625Jun 16, 2026Updated last week