sx-aurora-dev / llvm-project
llvm-project cloned from https://github.com/llvm/llvm-project and modified for VE
☆19Updated 3 weeks ago
Alternatives and similar repositories for llvm-project:
Users that are interested in llvm-project are comparing it to the libraries listed below
- This is the git repository for RIKEN simulator designed to simulate the binary code for Fujitsu A64FX.☆36Updated 4 years ago
- VEDA (VE Driver API)☆17Updated 2 months ago
- Another|Alternative|Awesome VE Offloading stack using ve-urpc☆14Updated last year
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 4 years ago
- World championship code for Graph500☆25Updated last year
- ☆51Updated 4 years ago
- Data Dependence Analyzer in the Polyhedral Model☆20Updated last year
- Library to plot integer sets and maps☆49Updated 8 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆75Updated 2 weeks ago
- instruction-bench☆36Updated 2 years ago
- Artifact for 'Register Optimizations for Stencils on GPUs'☆10Updated 6 years ago
- ☆21Updated 3 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆24Updated 5 years ago
- Polyhedral Parallel Code Generation (source repository: http://repo.or.cz/ppcg.git)☆125Updated 2 years ago
- A Specification and a Library for Data Exchange in Polyhedral Compilation Tools☆29Updated 9 months ago
- Library of High Precision Sparse Matrix Operations Accelerated by SIMD☆42Updated 3 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆111Updated this week
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 11 months ago
- Conversions to MLIR EmitC☆128Updated 4 months ago
- Tutorials for ARM SVE on Docker☆43Updated 2 years ago
- Python wrapper for isl, an integer set library☆77Updated last week
- A SYCL Implementation for CPU and SX-Aurora TSUBASA☆52Updated 2 years ago
- A GPU cache model for research purposes☆28Updated 11 years ago
- BLAS implementation for Intel FPGA☆78Updated 4 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆130Updated last year
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 4 years ago
- Intel® GPU Compute Samples☆106Updated 2 weeks ago
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆34Updated 4 years ago
- ☆55Updated 2 years ago