negativo17 / cuda
NVIDIA Compute Unified Device Architecture Toolkit
☆14Updated last month
Alternatives and similar repositories for cuda:
Users that are interested in cuda are comparing it to the libraries listed below
- Python bindings for libNVVM☆36Updated 10 years ago
- HSAIL LLVM Tree - Development has stopped on this branch This was a development branch☆15Updated 8 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- OpenCL tool to detect buffer overflows in GPU kernels☆21Updated 6 years ago
- ☆14Updated 5 years ago
- probability distributions for OpenCL☆9Updated 7 years ago
- Enable Polyhedral JIT compilation☆9Updated 6 years ago
- Compiler toolkit for neuFlow.☆26Updated 11 years ago
- An ONNX backend using PlaidML☆28Updated 6 years ago
- stage the upgrade of hcc-clang to clang ToT☆11Updated 5 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- DSL for stencils and image processing☆13Updated 8 years ago
- C for Media Runtime☆23Updated 2 years ago
- experimental port of nervana neon kernels in OpenCL☆11Updated 8 years ago
- Custom fork containing our own python backend for integration into neon☆15Updated 2 years ago
- Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …☆13Updated 3 years ago
- BLAS OpenCL implementation.☆15Updated 9 years ago
- C++ Utility classes☆17Updated 6 years ago
- OpenCL compilation with clang compiler.☆26Updated 7 months ago
- Multiplication using AVX512 and AVX512IFMA instructions☆23Updated 9 years ago
- HSAIL (BRIG) frontend for gcc☆11Updated 6 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆28Updated 5 years ago
- Library to program with streams, events, and to queue own functions into a stream.☆16Updated 6 months ago
- A utility to dump GPU's property☆6Updated 9 years ago
- ☆9Updated 5 years ago
- Ninja-based configuration system☆11Updated 4 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- A framework for building reranking models.☆29Updated 9 years ago
- These drivers have been superseded by ROCm Platform now hosted at Radeon Open Compute GitHub Repo☆61Updated 8 years ago