othmanemdi / kitea
☆9Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for kitea
- ☆11Updated 2 years ago
- ☆8Updated 2 years ago
- Powerful automatic differentiation in C++ and Python☆256Updated last month
- FastAD is a C++ implementation of automatic differentiation both forward and reverse mode.☆103Updated last year
- Structured Matrix Package (LBNL)☆167Updated this week
- Performance-portable geometric search library☆182Updated this week
- Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem☆294Updated 2 months ago
- A C++17 message passing library based on MPI☆167Updated 9 months ago
- RAJA Performance Portability Layer (C++)☆486Updated this week
- A streamlined CMake build system foundation for developing HPC software☆260Updated last week
- HPC solver for nonlinear optimization problems☆210Updated this week
- Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.☆148Updated this week
- Distributed memory, MPI based SuperLU☆188Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆310Updated this week
- Numerical linear algebra software package☆406Updated this week
- clad -- automatic differentiation for C/C++☆288Updated this week
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆57Updated last week
- C++ Template Linear Algebra PACKage☆41Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆112Updated 2 months ago
- Reference Implementation for stdBLAS☆128Updated last week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆44Updated 3 weeks ago
- MWE for using the Eigen library in CUDA kernels☆117Updated 2 years ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆192Updated this week
- An implementation of the revised simplex algorithm in CUDA for solving linear optimization problems in the form max{c*x | A*x=b, l<=x<=u}☆27Updated 7 years ago
- An implementation of BLAS using the SYCL open standard.☆259Updated last week
- Library of GPU-resident linear solvers☆57Updated last week
- Sample configuration files for using oneAPI in CI systems☆92Updated this week
- Shroud: generate Fortran and Python wrappers for C and C++ libraries☆90Updated this week
- CS infrastructure components for HPC applications☆157Updated this week
- Run a parallel command inside a split tmux window☆136Updated 2 years ago