bcarlet / ptx-mathLinks
☆18Updated 2 years ago
Alternatives and similar repositories for ptx-math
Users that are interested in ptx-math are comparing it to the libraries listed below
Sorting:
- ☆58Updated last month
- Embedded Universal DSL: a good DSL for us, by us☆48Updated this week
- LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…☆16Updated last year
- CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.☆124Updated 2 years ago
- A enumerator for MLIR, relying on the information given by IRDL.☆18Updated last week
- ☆31Updated 2 years ago
- A program synthesis framework for verified lifting applications☆56Updated 4 months ago
- assembler for NVIDIA FERMI. Imported from Google Code☆73Updated 10 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago
- Conversions to MLIR EmitC☆132Updated 10 months ago
- ☆40Updated 3 years ago
- Tutorial for LLVM Dev Conference 2019.☆15Updated 5 years ago
- ☆54Updated 5 years ago
- ☆38Updated 3 years ago
- CUDAAdvisor: a GPU profiling tool☆50Updated 7 years ago
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆109Updated 2 years ago
- A minimal (really) out-of-tree MLIR example☆45Updated 2 months ago
- ☆64Updated 6 years ago
- Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code☆30Updated 6 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆41Updated 4 years ago
- Instruction THroughput Estimator using MAchine Learning (ITHEMAL)☆150Updated 3 years ago
- CERE: Codelet Extractor and REplayer☆40Updated 2 years ago
- GPUVerify: a Verifier for GPU Kernels☆69Updated 3 years ago
- Some experiments with SMT solvers and GIMPLE IR☆77Updated this week
- A translation validation framework for MLIR☆88Updated 7 months ago
- Retargetable ML compilers for the twenty-first century!☆13Updated 5 months ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆43Updated 3 years ago
- Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.☆103Updated 15 years ago
- ☆83Updated this week
- KernelFaRer: Replacing Native-Code Idioms with High-Performance Library Calls☆12Updated last month