dcompiler / locaLinks
Program locality analysis tools
☆18Updated 6 years ago
Alternatives and similar repositories for loca
Users that are interested in loca are comparing it to the libraries listed below
Sorting:
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- ☆31Updated last week
- Creating beautiful gem5 simulations☆49Updated 4 years ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆28Updated 3 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆31Updated last year
- ☆19Updated 3 years ago
- The Splash-3 benchmark suite☆45Updated 2 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆43Updated 4 years ago
- (elastic) cuckoo hashing☆15Updated 5 years ago
- ☆40Updated 3 years ago
- CUDAAdvisor: a GPU profiling tool☆51Updated 7 years ago
- Artifact, reproducibility, and testing utilites for gem5☆23Updated 4 years ago
- GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators☆34Updated 3 years ago
- Multiple approaches to statistical simulation for computer architects☆15Updated 5 years ago
- ☆68Updated 6 years ago
- ☆34Updated 3 years ago
- A Multiplatform benchmark designed to provide holistic, detailed and close-to-hardware view of memory system performance with family of b…☆44Updated 3 months ago
- VASim is a virtual homogeneous non-deterministic finite automata automata simulator and transformation tool. VASim can parse, transform, …☆36Updated last year
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆18Updated 11 months ago
- Characterizing and Modeling Non-Volatile Memory Systems [MICRO'20, TopPicks'21]☆32Updated 4 years ago
- A fast, accurate, and easy-to-integrate memory simulator that model memory system performance with bandwidth--latency curves.☆31Updated 3 months ago
- Interprocedural Basic Block Code Layout Optimization☆18Updated 7 years ago
- A fast and scalable x86-64 multicore simulator☆31Updated 4 years ago
- a Pin tool for collecting microarchitecture-independent workload characteristics☆62Updated last year
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago
- Benchmark suite containing cache filtered traces for use with Ramulator. These include some of the workloads used in our SIGMETRICS 2019 …☆23Updated 5 years ago
- TLB Benchmarks☆35Updated 8 years ago
- Architecture-level Fault Injection Tool for GPU Application Resilience Evaluation☆80Updated 2 years ago
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆50Updated 7 years ago
- A Comprehensive Benchmark Suite for Graph Computing☆70Updated 6 years ago