andyneff / cuda-waste
CUDA Waste is a wrapper for emulation of CUDA programs on Windows
☆12Updated 8 years ago
Related projects: ⓘ
- Any code related to AMDGPUs☆8Updated 6 years ago
- ROCm - AMDGPU Compute Application Binary Interface☆40Updated 2 years ago
- OpenCL compilation with clang compiler.☆26Updated 3 months ago
- a clone of POCL that includes RISC-V newlib devices support and Vortex☆36Updated 3 months ago
- ROCm's Thunk Interface☆81Updated last month
- ROCm OpenCL Compiler Tool Driver☆24Updated 4 years ago
- A Benchmark Suite for Heterogeneous System Computation☆52Updated last week
- Tools for parsing, assembling, and disassembling HSAIL.☆70Updated 4 years ago
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆94Updated 14 years ago
- Source for Demystifying GPU Microarchitecture through Microbenchmarking☆16Updated last year
- OpenCL/SPIR-V implementation of HIP☆104Updated last year
- A source-to-source compiler for automatic parallelization of C programs through code annotation.☆60Updated 4 years ago
- ☆44Updated 7 months ago
- CMake modules used within the ROCm libraries☆59Updated this week
- C for Media Runtime☆23Updated 2 years ago
- MIOpenGEMM is now deprecated☆61Updated last year
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆61Updated this week
- Intel® FPGA Runtime for OpenCL™ Software Technology☆32Updated this week
- Bandwidth test for ROCm☆45Updated this week
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆46Updated 2 weeks ago
- ☆13Updated last year
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated 7 months ago
- CLRadeonExtender (GCN assembler, Radeon assembler) mirror☆95Updated 3 years ago
- HSAIL LLVM Tree - Development has stopped on this branch This was a development branch☆15Updated 8 years ago
- ROCm Driver RDMA Peer to Peer Support☆19Updated 5 years ago
- Compute applications.☆25Updated 4 years ago
- Input-aware cuBLAS/clBLAS implementation for better performance☆17Updated 2 years ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆33Updated 3 months ago
- oneAPI Level Zero Conformance & Performance test content☆45Updated last week
- GPU Optimization and Memory Abstraction Framework☆32Updated 4 years ago