Xilinx / inference-server
☆43Updated 8 months ago
Alternatives and similar repositories for inference-server:
Users that are interested in inference-server are comparing it to the libraries listed below
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆138Updated 2 weeks ago
- The Riallto Open Source Project from AMD☆74Updated 4 months ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆87Updated 4 months ago
- ☆37Updated 2 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆83Updated last year
- Example for running IREE in a bare-metal Arm environment.☆31Updated 2 weeks ago
- IREE plugin repository for the AMD AIE accelerator☆83Updated this week
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 5 months ago
- ☆137Updated this week
- OpenAI Triton backend for Intel® GPUs☆168Updated this week
- HeteroCL-MLIR dialect for accelerator design☆41Updated 5 months ago
- ☆59Updated last year
- rocWMMA☆102Updated this week
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆115Updated 2 months ago
- Data-Centric MLIR dialect☆40Updated last year
- ☆90Updated this week
- Bandwidth test for ROCm☆54Updated this week
- a clone of POCL that includes RISC-V newlib devices support and Vortex☆40Updated last month
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆115Updated 4 months ago
- Experiments and prototypes associated with IREE or MLIR☆50Updated 7 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated this week
- MLIR-based partitioning system☆71Updated this week
- This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platfor…☆19Updated 3 months ago
- Provides Spatial with front-end support from popular machine learning frameworks☆33Updated 5 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆39Updated 10 months ago
- ☆13Updated this week
- Conversions to MLIR EmitC☆127Updated 3 months ago
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆24Updated 10 months ago
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆33Updated 4 years ago