upenn-acg / ocolos-public
Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.
☆52Updated last year
Alternatives and similar repositories for ocolos-public:
Users that are interested in ocolos-public are comparing it to the libraries listed below
- ☆28Updated 2 years ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆28Updated 6 years ago
- The Splash-3 benchmark suite☆43Updated last year
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 4 months ago
- DMon Prototype for OSDI 2021 Artifact Evaluation☆22Updated 3 years ago
- compiling DSLs to high-level hardware instructions☆22Updated 2 years ago
- ☆52Updated 5 years ago
- the Stanford Transactional Applications for Multi-Processing; a benchmark suite for transactional memory research☆42Updated 3 years ago
- Bridging polyhedral analysis tools to the MLIR framework☆109Updated last year
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆129Updated 3 weeks ago
- A compiler to automatically transform applications into disaggregated memory apps.☆15Updated last year
- Slice-aware Memory Management - Exploiting NUCA Characteristic of LLC in Intel Processors☆39Updated 5 years ago
- Source code for the paper "Profile Guided Optimization without Profiles: A Machine Learning Approach"☆23Updated 3 years ago
- User-space Page Management☆108Updated 7 months ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆74Updated last year
- InstLatX64_Demo☆42Updated 2 weeks ago
- Tutorial for LLVM Dev Conference 2019.☆15Updated 5 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 5 months ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆123Updated 2 years ago
- CPU micro benchmarks☆47Updated last month
- A framework that helps implementing swizzle GPU kernels☆42Updated 5 years ago
- ROB size testing utility☆144Updated 3 years ago
- CCProf: Lightweight Detection of Cache Conflicts☆26Updated 3 years ago
- Interprocedural Basic Block Code Layout Optimization☆18Updated 6 years ago
- A GPU FP32 computation method with Tensor Cores.☆20Updated 2 years ago
- MemLiner is a remote-memory-friendly runtime system.☆32Updated 2 years ago