ROCm / omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
☆299Updated this week
Related projects ⓘ
Alternatives and complementary repositories for omnitrace
- Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template…☆353Updated 3 months ago
- High-level C++ for Accelerator Clusters☆142Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆135Updated this week
- Reference implementation of mdspan targeting C++23☆406Updated last month
- Clang with JIT extensions☆229Updated last year
- "See why!" Explains and suggests fixes for compile-time errors for C, C++, C#, Go, Java, LaTeX, PHP, Python, Ruby, Rust, and TypeScript☆277Updated last month
- Expressive Vector Engine - SIMD in C++ Goes Brrrr☆961Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆112Updated 7 months ago
- LLVM Optimization to extract a function, embedded in its intermediate representation in the binary, and execute it using the LLVM Just-In…☆513Updated 3 years ago
- Caliper is an instrumentation and performance profiling library☆350Updated last week
- Companion Repository for the Lecture Slides for the Clang Libraries☆88Updated 8 months ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆517Updated 5 months ago
- Comprehensive benchmarks of C++ maps☆299Updated last year
- SYCL Open Source Specification☆114Updated this week
- C++20 [Minimal] Static Perfect Hash library☆176Updated last month
- A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines☆609Updated 3 weeks ago
- Agenium Scale vectorization library for CPUs and GPUs☆326Updated 3 years ago
- uops.info Code Analyzer☆237Updated 9 months ago
- A simple, extensible, portable, efficient and header-only SIMD library!☆229Updated 3 years ago
- std::simd for GCC [ISO/IEC TS 19570:2018]☆578Updated last year
- An implementation of BLAS using the SYCL open standard.☆259Updated last week
- C++20 Meta-Programming library☆243Updated 2 months ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆46Updated this week
- The Berkeley Container Library☆120Updated last year
- ☆68Updated 4 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- C++ zero-cost abstraction for SoA/AoS memory layouts☆184Updated 2 years ago
- CUDA kernel author's tools☆107Updated 2 years ago
- Convert .ninja_log files to chrome's about:tracing format.☆419Updated 5 months ago
- Concurrent Deferred Reference Counting☆149Updated 8 months ago