npinto / python-cuda
Python bindings for CUDA 2.1 with numpy integration
☆25Updated 15 years ago
Related projects ⓘ
Alternatives and complementary repositories for python-cuda
- C++ Summer Lecture Series 2016☆13Updated 8 years ago
- A common set of compute primitives for PyCUDA and PyOpenCL☆58Updated 3 months ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- [deprecated] Reference Implementation of OpenSHMEM on GASNet (specification <= 1.3)☆43Updated 7 years ago
- Project ARES represents a joint effort between LANL and ORNL to introduce a common compiler representation and tool-chain for HPC applica…☆10Updated 7 years ago
- High-level framework for stencil computations☆39Updated 9 years ago
- OpenUH - Open Source UH Compiler☆53Updated 7 years ago
- A Fortran language frontend for LLVM☆25Updated 11 years ago
- Fortran Front-End☆34Updated 2 years ago
- Fortran frontend for LLVM☆22Updated 9 years ago
- A Sound and Complete Verification Tool for Warp-Specialized GPU Kernels☆18Updated 9 years ago
- Generate and execute native code at run time, from Python☆51Updated 3 months ago
- Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications☆21Updated 8 months ago
- A compact hash algorithm for CPUs and GPUs using OpenCL☆14Updated 4 years ago
- Scientific library for high-precision computations and research☆50Updated 7 years ago
- SLURM: A Highly Scalable Resource Manager☆65Updated 7 years ago
- An API to provide an efficient distributed queue on a cluster. Libcircle is currently used in production to quickly traverse and perform …☆98Updated 4 years ago
- ☆58Updated 2 years ago
- C++ library for numerical arrays and tensor objects and operations with them, designed to allow Matlab-style programming.☆51Updated last year
- Library to program with streams, events, and to queue own functions into a stream.☆16Updated 4 months ago
- This repository contains components that will support percolation via OpenCL and CUDA☆32Updated 2 years ago
- Universal Number Library☆40Updated 6 years ago
- The parallel API to be utilized by AllScale projects to express parallelism.☆9Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Checks to verify the usage of the MPI API in C and C++ code, based on Clang’s Static Analyzer and Clang-Tidy.☆39Updated 2 months ago
- Tensor Contraction Code Generator☆36Updated 7 years ago
- ☆15Updated 8 years ago
- Parallel implementation of bzip2 using cuda☆32Updated 13 years ago
- ☆10Updated 9 years ago