upenn-acg / ocolos-public
Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.
☆52Updated last year
Alternatives and similar repositories for ocolos-public:
Users that are interested in ocolos-public are comparing it to the libraries listed below
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago
- ☆53Updated 5 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆28Updated 6 months ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆56Updated 5 months ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- ☆28Updated 2 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆28Updated 6 years ago
- DMon Prototype for OSDI 2021 Artifact Evaluation☆22Updated 3 years ago
- A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more☆130Updated last week
- Slice-aware Memory Management - Exploiting NUCA Characteristic of LLC in Intel Processors☆40Updated 5 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- compiling DSLs to high-level hardware instructions☆22Updated 2 years ago
- Stencil Probe - a stencil microbenchmark☆30Updated 12 years ago
- User-space Page Management☆108Updated 7 months ago
- A GPU FP32 computation method with Tensor Cores.☆20Updated 2 years ago
- Universal Presentation: A Header-only C++ Library to Cout STL containers and more☆19Updated last year
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆16Updated 2 years ago
- Characterizing and Modeling Non-Volatile Memory Systems [MICRO'20, TopPicks'21]☆33Updated 3 years ago
- A host-based framework that transparently extends the GPU addressable global memory space beyond the host memory using NVM-backed data po…☆60Updated 4 years ago
- Skyloft: A General High-Efficient Scheduling Framework in User Space (SOSP 2024)☆33Updated 6 months ago
- Source code for the paper "Profile Guided Optimization without Profiles: A Machine Learning Approach"☆23Updated 3 years ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆41Updated 2 months ago
- Repo for OSDI 2023 paper: "Ship your Critical Section Not Your Data: Enabling Transparent Delegation with TCLocks"☆14Updated 4 months ago
- ☆21Updated 2 years ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆74Updated last year
- The Splash-3 benchmark suite☆43Updated last year
- A compiler to automatically transform applications into disaggregated memory apps.☆16Updated last year
- Interprocedural Basic Block Code Layout Optimization☆18Updated 6 years ago
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆21Updated 4 years ago