hummingtree / cuda-graph-with-dynamic-parameters
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for cuda-graph-with-dynamic-parameters
- ☆78Updated 6 months ago
- A lightweight, Pythonic, frontend for MLIR☆79Updated last year
- development repository for the open earth compiler☆77Updated 3 years ago
- TPP experimentation on MLIR for linear algebra☆111Updated 2 weeks ago
- ☆50Updated 4 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆123Updated last year
- ☆40Updated 3 years ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆30Updated 3 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated 2 weeks ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆60Updated 6 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- An extension library of WMMA API (Tensor Core API)☆82Updated 3 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- MLIR-based partitioning system☆36Updated this week
- 🎃 GPU load-balancing library for regular and irregular computations.☆57Updated 4 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆123Updated this week
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆127Updated 4 years ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆65Updated last year
- ☆47Updated 2 weeks ago
- Assembler for NVIDIA Volta and Turing GPUs☆200Updated 2 years ago
- Chai☆42Updated 11 months ago
- ☆57Updated this week
- ☆44Updated 5 years ago
- ☆47Updated 5 years ago
- ☆15Updated 5 years ago
- ☆41Updated 4 years ago
- Official BOLT Repository☆27Updated 2 months ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆77Updated 5 years ago