preshing / analyze-spec-benchmarks
☆40Updated 7 years ago
Alternatives and similar repositories for analyze-spec-benchmarks:
Users that are interested in analyze-spec-benchmarks are comparing it to the libraries listed below
- A parallel, distributed simulator for multicores.☆181Updated 9 years ago
- pmu event analysis package☆76Updated last year
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆98Updated 14 years ago
- A fast and highly scalable GPU dynamic memory allocator☆104Updated 10 years ago
- A Benchmark Suite for Heterogeneous System Computation☆53Updated last month
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆72Updated 10 years ago
- Measure instruction latency and throughput☆23Updated last month
- ☆75Updated last year
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- NUMAPROF is a NUMA memory profliler based on Pintool to track your remote memory accesses.☆46Updated 9 months ago
- Clustered/Stacked Filled Bar Graph Generator☆34Updated 8 years ago
- Mimir is a new implementation of MapReduce over MPI. Mimir inherits the core principles of existing MapReduce frameworks, such as MR-MPI,…☆21Updated 6 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆92Updated 3 weeks ago
- FROZEN: the master branch has merged with the libfabric git repo☆31Updated 6 years ago
- A Shared Memory Multithreaded Graph Benchmark Suite for Multicores☆35Updated 2 years ago
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- CERE: Codelet Extractor and REplayer☆40Updated last year
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆36Updated 3 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆38Updated 9 years ago
- A domain-specific language and compiler for image processing☆76Updated 4 years ago
- RV: A Unified Region Vectorizer for LLVM☆107Updated 2 months ago
- Compute applications.☆24Updated 5 years ago
- A GPU Database☆147Updated 7 years ago
- ☆34Updated 3 years ago
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated 2 years ago
- an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language☆40Updated 2 years ago
- A tool for measuring the cache-coherence latencies of a processor (i.e., the latencies of loads, stores, CAS, FAI, TAS, and SWAP).☆76Updated 3 years ago