Xilinx / inference-server
☆43Updated 7 months ago
Alternatives and similar repositories for inference-server:
Users that are interested in inference-server are comparing it to the libraries listed below
- The Riallto Open Source Project from AMD☆71Updated 2 months ago
- ☆58Updated last year
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆134Updated this week
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆115Updated 2 months ago
- Example for running IREE in a bare-metal Arm environment.☆26Updated 2 weeks ago
- oneAPI Data Parallel C++ (DPC++) language reference☆24Updated 2 years ago
- ☆23Updated 11 months ago
- IREE plugin repository for the AMD AIE accelerator☆73Updated this week
- ☆55Updated 2 years ago
- BLAS implementation for Intel FPGA☆76Updated 4 years ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆85Updated 3 months ago
- ☆37Updated 2 years ago
- An extension library of WMMA API (Tensor Core API)☆87Updated 6 months ago
- ☆134Updated this week
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 4 months ago
- Data-Centric MLIR dialect☆40Updated last year
- Experiments and prototypes associated with IREE or MLIR☆51Updated 5 months ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆83Updated last year
- An IR for efficiently simulating distributed ML computation.☆25Updated last year
- Intel® SHMEM - Device initiated shared memory based communication library☆22Updated 2 months ago
- GPTPU for SC 2021☆51Updated last year
- Standalone Flash Attention v2 kernel without libtorch dependency☆99Updated 4 months ago
- Bandwidth test for ROCm☆53Updated this week
- MLIR-based partitioning system☆58Updated this week
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆29Updated 4 years ago
- AI applications and tools☆26Updated this week
- Emulating DMA Engines on GPUs for Performance and Portability☆35Updated 9 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆231Updated this week
- SYCL Reference Manual☆27Updated 9 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated this week