HabanaAI / tpc_llvm

TPC-CLANG compiler that compiles a TPC C programming language which is used in HabanaLabs Deep-Learning Accelerators

☆26

Alternatives and similar repositories for tpc_llvm:

Users that are interested in tpc_llvm are comparing it to the libraries listed below

xiuxiazhang / KeplerAs
An Open Source Kepler GPU Assembler
☆20Updated 7 years ago
cupbop / CuPBoP
A framework that support executing unmodified CUDA source code on non-NVIDIA devices.
☆111Updated last week
intel / vc-intrinsics
☆56Updated last week
NVlabs / ptxmemorymodel
☆48Updated 5 years ago
ROCm / rocm_bandwidth_test
Bandwidth test for ROCm
☆52Updated 3 weeks ago
nod-ai / iree-amd-aie
IREE plugin repository for the AMD AIE accelerator
☆71Updated this week
Zhao-Dongyu / sgemm_riscv
This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platfor…
☆18Updated last month
kumasento / polymer
Bridging polyhedral analysis tools to the MLIR framework
☆106Updated last year
gpgpu-sim / gpgpu-sim_simulations
A repository that compliments gpgpu-sim, providing automated regression scripts, simulation launching utilities and the code + arguments …
☆68Updated 4 years ago
ROCm / rocMLIR
☆131Updated this week
daadaada / gas
☆40Updated 4 years ago
decodecudabinary / Decoding-CUDA-Binary
☆51Updated 5 years ago
lanl / PPT
Performance Prediction Toolkit
☆51Updated 3 weeks ago
HabanaAI / SynapseAI_Core
SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi
☆38Updated 2 years ago
NVlabs / SASSI
Flexible GPGPU instrumentation
☆86Updated 5 years ago
gpudirect / libgdsync
GPUDirect Async support for IB Verbs
☆91Updated 2 years ago
spcl / mlir-dace
Data-Centric MLIR dialect
☆39Updated last year
ChipsandCheese / Microbenchmarks
Trying to figure various CPU things out
☆73Updated 10 months ago
NodLabs / mlir-examples
a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]
☆29Updated 3 years ago
bondhugula / llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…
☆32Updated last month
ROCm / TransferBench
TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)
☆38Updated this week
hyqneuron / asfermi
assembler for NVIDIA FERMI. Imported from Google Code
☆71Updated 9 years ago
polymage-labs / mlirx
MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com
☆38Updated last year
aditya4d / gemm-vega64
Implement asm gemm on vega64 for 4096x4096 fp32 matrix
☆21Updated 5 years ago
mattsinc / heterosync
HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs
☆27Updated 3 months ago
ROCm / rocSHMEM
rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.
☆48Updated 2 weeks ago
gpgpu-sim / cutlass-gpgpu-sim
☆23Updated 5 years ago
karakozov / gpudma
GPUDirect example
☆58Updated 3 years ago
laanwj / decuda
Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.
☆97Updated 14 years ago
travisdowns / robsize
ROB size testing utility
☆140Updated 3 years ago