Xilinx / inference-server
☆45Updated 10 months ago
Alternatives and similar repositories for inference-server:
Users that are interested in inference-server are comparing it to the libraries listed below
- The Riallto Open Source Project from AMD☆77Updated 3 weeks ago
- Example for running IREE in a bare-metal Arm environment.☆33Updated 2 months ago
- AI applications and tools☆27Updated last week
- IREE plugin repository for the AMD AIE accelerator☆93Updated this week
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆147Updated this week
- Experiments and prototypes associated with IREE or MLIR☆50Updated 9 months ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)☆34Updated 2 weeks ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- ☆60Updated last year
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆126Updated 4 months ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆89Updated last month
- Intel® SHMEM - Device initiated shared memory based communication library☆23Updated last month
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- End to End steps for adding custom ops in PyTorch.☆22Updated 4 years ago
- Data-Centric MLIR dialect☆41Updated last year
- BLAS implementation for Intel FPGA☆78Updated 4 years ago
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆34Updated 4 years ago
- oneAPI Data Parallel C++ (DPC++) language reference☆26Updated 2 years ago
- ☆95Updated this week
- MLIR-based partitioning system☆82Updated this week
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆86Updated 2 years ago
- HeteroCL-MLIR dialect for accelerator design☆40Updated 7 months ago
- A Data-Centric Compiler for Machine Learning☆82Updated last year
- Fork of LLVM to support AMD AIEngine processors☆139Updated this week
- This project contains a code generator that produces static C NN inference deployment code targeting tiny micro-controllers (TinyML) as r…☆29Updated 3 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆40Updated last month
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated last week
- Conversions to MLIR EmitC☆128Updated 4 months ago
- ☆142Updated this week
- ☆46Updated 2 weeks ago