Performance Tuning Tutorial given at Oak Ridge National Laboratory
☆184May 19, 2021Updated 4 years ago
Alternatives and similar repositories for performance_tuning_tutorial
Users that are interested in performance_tuning_tutorial are comparing it to the libraries listed below
Sorting:
- Tracing-based reverse mode automatic differentiation (like autograd!)☆27Feb 2, 2025Updated last year
- The book "Performance Analysis and Tuning on Modern CPU"☆3,503Jun 9, 2025Updated 9 months ago
- Intel processor trace tools for analyzing performance of function☆108Aug 14, 2025Updated 7 months ago
- STL-compliant stable vector container☆33Oct 21, 2018Updated 7 years ago
- memory access workload simulator☆36Jan 26, 2026Updated last month
- Hijack Linux kernel syscall with f-stack api.☆19Aug 9, 2017Updated 8 years ago
- This is an online course where you can learn and master the skill of low-level performance analysis and tuning.☆3,576Mar 10, 2026Updated last week
- Install a hardware breakpoint in Linux kernel for tracing/debugging☆26Apr 20, 2025Updated 11 months ago
- Articles on various software desing and development topics, with accent on the contamporary C++☆277Sep 3, 2025Updated 6 months ago
- ☆148May 21, 2025Updated 9 months ago
- Track memory leaks for Linux kernel modules using eBPF☆46Mar 11, 2026Updated last week
- FlameGraphs in Your App☆34Jan 2, 2025Updated last year
- Latency Debug compatible LLVM compiler based on LLVM 14☆17Apr 15, 2024Updated last year
- Last Writer Slicing: data provenance tracking for concurrent program debugging & analysis☆13Nov 14, 2014Updated 11 years ago
- A minimal (really) out-of-tree MLIR example☆47Aug 14, 2025Updated 7 months ago
- My personal work on the numerical projects of a book called "A First Course in Stochastic Calculus".☆16Apr 29, 2022Updated 3 years ago
- Clone of libtraceevent from kernel.org☆20Feb 4, 2026Updated last month
- Intel PMU profiling tools☆2,217Mar 3, 2026Updated 2 weeks ago
- Making FB's flashcache to cache a group of disks with a single SSD☆35Aug 29, 2014Updated 11 years ago
- Open-source Linux performance suite for engineers—profiling and tuning workloads and system configurations.☆439Mar 13, 2026Updated last week
- A single producer single consumer lock free queue that utilizes copy / move assignment to transfer messages. Achieves a top performance, …☆89Nov 30, 2025Updated 3 months ago
- ☆16Feb 29, 2020Updated 6 years ago
- The Linux perf GUI for performance analysis.☆5,007Mar 10, 2026Updated last week
- Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception ha…☆1,901Dec 23, 2025Updated 2 months ago
- Finite State Machine in C++☆14May 2, 2024Updated last year
- Order Book implementation in C++20 (Concepts & Co-Routines)☆30May 14, 2024Updated last year
- HeraclesQL is a Python DSL for writing alerts!☆26Dec 3, 2025Updated 3 months ago
- SQLStorm: Taking Database Benchmarking into the LLM Era☆78Jan 2, 2026Updated 2 months ago
- Demos for blog post☆13Sep 28, 2025Updated 5 months ago
- ☆38Feb 19, 2026Updated last month
- Cheap: customized heaps for improved application performance.☆28Oct 11, 2022Updated 3 years ago
- magic-trace collects and displays high-resolution traces of what a process is doing☆5,267Updated this week
- MoSAIC: Modular system for Acceleration Integration MoSAIC☆10Aug 22, 2025Updated 6 months ago
- DARMA/magistrate => Serialization and checkpointing library☆12Jan 26, 2026Updated last month
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo☆12Feb 12, 2023Updated 3 years ago
- ☆14Mar 18, 2025Updated last year
- Reference Implementation for stdBLAS☆157Mar 12, 2026Updated last week
- Demo repository for all the different ways to do eBPF Tracing☆18Feb 9, 2026Updated last month
- ☆16Nov 28, 2024Updated last year