TauferLab / MimirLinks
Mimir is a new implementation of MapReduce over MPI. Mimir inherits the core principles of existing MapReduce frameworks, such as MR-MPI, while redesigning the execution model to incorporate a number of sophisticated optimization techniques that achieve similar or better performance with significant reduction in the amount of memory used.
☆21Updated 7 years ago
Alternatives and similar repositories for Mimir
Users that are interested in Mimir are comparing it to the libraries listed below
Sorting:
- Graph500 reference implementations☆181Updated 3 years ago
- Parallel Memory Bandwidth Measurement / Benchmark Tool☆115Updated 3 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Updated 6 years ago
- an API and runtime environment for data processing with MapReduce for shared-memory multi-core & multiprocessor systems.☆97Updated last year
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- OFI Programmer's Guide☆52Updated 3 years ago
- GPUfs - File system support for NVIDIA GPUs☆99Updated 7 years ago
- MapReduce for multi-core☆50Updated 12 years ago
- A NUMA-aware Graph-structured Analytics Framework☆44Updated 7 years ago
- High-performance graph processing on hybrid CPU-GPU platforms by using dynamic load-balancing☆12Updated 9 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆31Updated last year
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆60Updated 12 years ago
- An attempt at achieving the theoretical best memory bandwidth of my machine.☆54Updated 12 years ago
- ☆28Updated 8 months ago
- Linux Cross-Memory Attach☆96Updated last year
- A Comprehensive Benchmark Suite for Graph Computing☆70Updated 6 years ago
- Portals is a low-level network API for high-performance networking on high-performance computing systems developed by Sandia National Lab…☆41Updated last year
- A tool for measuring the cache-coherence latencies of a processor (i.e., the latencies of loads, stores, CAS, FAI, TAS, and SWAP).☆79Updated 3 years ago
- Simplified Interface to Complex Memory☆28Updated 2 years ago
- Blaze runtime system that support efficient accelerator integration for big data.☆24Updated 8 years ago
- A framework for pipelined computing on GPU☆30Updated 6 years ago
- A Benchmark Suite for Heterogeneous System Computation☆55Updated 11 months ago
- A Distributed Multi-GPU System for Fast Graph Processing☆65Updated 7 years ago
- Artifact for PPoPP 2018 paper "Making Pull-Based Graph Processing Performant"☆23Updated 5 years ago
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆77Updated 5 months ago
- TLB Benchmarks☆35Updated 8 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 8 years ago
- Grappa: scaling irregular applications on commodity clusters☆159Updated 8 years ago
- The SHOC Benchmark Suite☆260Updated 4 months ago
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-ar…☆100Updated 6 years ago