gevtushenko / cuda_jitLinks
โ16Updated 4 years ago
Alternatives and similar repositories for cuda_jit
Users that are interested in cuda_jit are comparing it to the libraries listed below
Sorting:
- โ58Updated last week
- ๐ GPU load-balancing library for regular and irregular computations.โ62Updated last year
- โ23Updated 3 years ago
- Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast!โ103Updated 2 weeks ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.โ55Updated 4 months ago
- โ70Updated 5 years ago
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.โ119Updated 2 years ago
- development repository for the open earth compilerโ80Updated 4 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repoโ119Updated this week
- Conversions to MLIR EmitCโ128Updated 7 months ago
- RV: A Unified Region Vectorizer for LLVMโ111Updated 2 months ago
- Little OpenMP Libraryโ163Updated 2 years ago
- โ52Updated 5 years ago
- Intelยฎ Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.โ138Updated this week
- A library to benchmark CUDA code, similar to google benchmark.โ29Updated 4 years ago
- Advanced Profiling and Analytics for AMD Hardwareโ161Updated this week
- Generate simple index ranges in C++ and CUDA C++โ39Updated 2 years ago
- portDNN is a library implementing neural network algorithms written using SYCLโ113Updated last year
- โ30Updated 2 years ago
- โ64Updated 6 years ago
- TPP experimentation on MLIR for linear algebraโ133Updated last week
- Benchmark for measuring the performance of sparse and irregular memory access.โ78Updated 3 months ago
- A task benchmarkโ43Updated last year
- A unified framework across multiple programming platformsโ41Updated 2 months ago
- Official BOLT Repositoryโ30Updated 11 months ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.โ134Updated last year
- SYCL Reference Manualโ28Updated last year
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.โ66Updated 5 years ago
- Kernel Tuning Toolkitโ62Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repoโ84Updated this week