StanfordLegion / legion
The Legion Parallel Programming System
☆723Updated last month
Alternatives and similar repositories for legion
Users that are interested in legion are comparing it to the libraries listed below
Sorting:
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,294Updated 3 weeks ago
- This is a set of simple programs that can be used to explore the features of a parallel platform.☆431Updated 3 weeks ago
- GraphIt - A High-Performance Domain Specific Language for Graph Analytics☆378Updated 2 years ago
- Programmable CUDA/C++ GPU Graph Analytics☆1,021Updated 9 months ago
- RAJA Performance Portability Layer (C++)☆516Updated this week
- The Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.☆214Updated this week
- A code generator for array-based code on CPUs and GPUs☆602Updated 2 weeks ago
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆533Updated 2 months ago
- SuiteSparse graph algorithms in the language of linear algebra. For production: (default) STABLE branch. Code development: ask me for t…☆373Updated this week
- Caliper is an instrumentation and performance profiling library☆372Updated last week
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆418Updated last month
- ☆537Updated this week
- HPCToolkit performance tools: measurement and analysis components☆341Updated 2 months ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆869Updated last week
- Distributed-memory, arbitrary-precision, dense and sparse-direct linear algebra, conic optimization, and lattice reduction☆510Updated 5 years ago
- High-performance automatic differentiation of LLVM and MLIR.☆1,387Updated this week
- CUSP : A C++ Templated Sparse Matrix Library☆413Updated 6 months ago
- High-Performance Linear Algebra-based Graph Primitives on GPUs☆222Updated 3 years ago
- ☆134Updated last year
- Livermore Big Artificial Neural Network Toolkit☆228Updated last month
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆205Updated last week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆108Updated last year
- A massively-parallel, block-sparse tensor framework written in C++☆287Updated last week
- STREAM, for lots of devices written in many programming models☆335Updated 8 months ago
- Assembler for NVIDIA Maxwell architecture☆996Updated 2 years ago
- The Lift programming language and compiler☆212Updated 3 years ago
- DASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science☆158Updated 3 years ago
- Galois: C++ library for multi-core and multi-node parallelization☆325Updated 11 months ago
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆336Updated this week
- An application-focused API for memory management on NUMA & GPU architectures☆356Updated last week