bcarlet / ptx-mathLinks

☆18

Alternatives and similar repositories for ptx-math

Users that are interested in ptx-math are comparing it to the libraries listed below

Sorting:

intel / vc-intrinsics
☆58Updated last month
llvm / eudsl
Embedded Universal DSL: a good DSL for us, by us
☆48Updated this week
michalpaszkowski / LLVM-Canon
LLVM-Canon aims to transform LLVM modules into a canonical form by reordering and renaming instructions while preserving the same semanti…
☆16Updated last year
SunsetQuest / CudaPAD
CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.
☆124Updated 2 years ago
opencompl / mlir-fuzz
A enumerator for MLIR, relying on the information given by IRDL.
☆18Updated last week
ychen306 / vegen
☆31Updated 2 years ago
metalift / metalift
A program synthesis framework for verified lifting applications
☆56Updated 4 months ago
hyqneuron / asfermi
assembler for NVIDIA FERMI. Imported from Google Code
☆73Updated 10 years ago
spcl / haystack
Haystack is an analytical cache model that given a program computes the number of cache misses.
☆46Updated 6 years ago
iml130 / mlir-emitc
Conversions to MLIR EmitC
☆132Updated 10 months ago
ithemal / bhive
☆40Updated 3 years ago
kitbarton / LLVMLoopOptTutorial
Tutorial for LLVM Dev Conference 2019.
☆15Updated 5 years ago
decodecudabinary / Decoding-CUDA-Binary
☆54Updated 5 years ago
hkust-adsl / gass
☆38Updated 3 years ago
sderek / CUDAAdvisor
CUDAAdvisor: a GPU profiling tool
☆50Updated 7 years ago
microsoft / Accera
Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research
☆109Updated 2 years ago
makslevental / mmlir
A minimal (really) out-of-tree MLIR example
☆45Updated 2 months ago
NVlabs / ptxmemorymodel
☆64Updated 6 years ago
revec / VectorBench
Benchmarks for auto-vectorization and revectorization, including both hand-vectorized and scalar code
☆30Updated 6 years ago
SamAinsworth / reproduce-cgo2017-paper
Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.
☆41Updated 4 years ago
ithemal / Ithemal
Instruction THroughput Estimator using MAchine Learning (ITHEMAL)
☆150Updated 3 years ago
benchmark-subsetting / cere
CERE: Codelet Extractor and REplayer
☆40Updated 2 years ago
mc-imperial / gpuverify
GPUVerify: a Verifier for GPU Kernels
☆69Updated 3 years ago
kristerw / smtgcc
Some experiments with SMT solvers and GIMPLE IR
☆77Updated this week
aqjune / mlir-tv
A translation validation framework for MLIR
☆88Updated 7 months ago
MPACT-ORG / mpact-compiler
Retargetable ML compilers for the twenty-first century!
☆13Updated 5 months ago
vortexgpgpu / NVPTX-SPIRV-Translator
The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.
☆43Updated 3 years ago
laanwj / decuda
Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.
☆103Updated 15 years ago
llvm / mlir-www
☆83Updated this week
jaopaulolc / KernelFaRer
KernelFaRer: Replacing Native-Code Idioms with High-Performance Library Calls
☆12Updated last month