upenn-acg / ocolos-public
Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.
☆52Updated last year
Alternatives and similar repositories for ocolos-public:
Users that are interested in ocolos-public are comparing it to the libraries listed below
- ☆28Updated 2 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆28Updated 6 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆45Updated 5 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- Monorepo for the OpenCilk compiler. Forked from llvm/llvm-project and based on Tapir/LLVM.☆101Updated last week
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 5 months ago
- Collaborative Parallelization Framework (CPF)☆32Updated last year
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 3 months ago
- A compiler to automatically transform applications into disaggregated memory apps.☆15Updated last year
- Source code for the paper "Profile Guided Optimization without Profiles: A Machine Learning Approach"☆24Updated 3 years ago
- ☆52Updated 5 years ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆40Updated last month
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆19Updated 8 months ago
- Updated C version of the Test Suite for Vectorising Compilers☆55Updated 11 months ago
- A false sharing detection and repair tool☆13Updated 5 years ago
- ☆34Updated 3 years ago
- Bridging polyhedral analysis tools to the MLIR framework☆107Updated last year
- compiling DSLs to high-level hardware instructions☆22Updated 2 years ago
- Interprocedural Basic Block Code Layout Optimization☆18Updated 6 years ago
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆129Updated last week
- User-space Page Management☆106Updated 6 months ago
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆16Updated 2 years ago
- CPU micro benchmarks☆45Updated 3 weeks ago
- The Splash-3 benchmark suite☆42Updated last year
- DMon Prototype for OSDI 2021 Artifact Evaluation☆21Updated 3 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆81Updated last year
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆109Updated 2 years ago
- Stencil Probe - a stencil microbenchmark☆30Updated 12 years ago