DanieleDeSensi / mammut
MAchine Micro Management UTilities
☆11Updated 3 years ago
Related projects: ⓘ
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"☆11Updated 8 years ago
- ☆25Updated 2 years ago
- OpenMP-based parallel program for counting the number of triangles in a sparse graph☆16Updated 6 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 4 years ago
- A CUDA-based multi-GPU vertex-centric graph processing framework based on Warp Segmentation and Vertex Refinement techniques.☆10Updated 7 years ago
- Simplified Interface to Complex Memory☆26Updated last year
- Intel Heterogeneous Research Compiler (iHRC)☆25Updated last year
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- The SparseX sparse kernel optimization library☆39Updated 5 years ago
- Nitro Autotuning Framework☆9Updated 8 years ago
- Global Memory and Threading runtime system☆23Updated 4 months ago
- GraphMat graph analytics framework☆99Updated last year
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 6 years ago
- ☆11Updated this week
- Enterprise: Breadth-First Graph Traversal on GPUs. SC'15.☆30Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- Heterogeneous Active Messages C++ library☆21Updated 4 years ago
- An implementation of ARMCI using MPI one-sided communication (RMA)☆12Updated last month
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 6 years ago
- ☆39Updated 6 years ago
- NAS Parallel Benchmarks for GPU☆18Updated last month
- Webgraph++ code (http://cnets.indiana.edu/groups/nan/webgraph/)☆30Updated last month
- Mimir is a new implementation of MapReduce over MPI. Mimir inherits the core principles of existing MapReduce frameworks, such as MR-MPI,…☆21Updated 5 years ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Updated 9 years ago
- A Benchmark Suite for Heterogeneous System Computation☆52Updated last week
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆12Updated 3 years ago
- Enable Polyhedral JIT compilation☆9Updated 6 years ago
- Artifact for PPoPP 2018 paper "Making Pull-Based Graph Processing Performant"☆23Updated 4 years ago
- MPI+OpenMP implementation of Louvain method for Graph Community Detection, with a number of parallel heuristics/approximate computing tec…☆25Updated 11 months ago
- GPU Optimization and Memory Abstraction Framework☆32Updated 4 years ago
- High-Performance Streaming Graph Analytics on GPUs☆31Updated 5 years ago