Xilinx / inference-serverLinks
☆47Updated last year
Alternatives and similar repositories for inference-server
Users that are interested in inference-server are comparing it to the libraries listed below
Sorting:
- The Riallto Open Source Project from AMD☆84Updated 5 months ago
- AI applications and tools☆29Updated last month
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆161Updated this week
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆121Updated 11 months ago
- IREE plugin repository for the AMD AIE accelerator☆107Updated last week
- Fork of LLVM to support AMD AIEngine processors☆167Updated this week
- Example for running IREE in a bare-metal Arm environment.☆40Updated 2 months ago
- ☆60Updated 2 years ago
- ☆37Updated 3 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆87Updated 2 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆136Updated 9 months ago
- ☆55Updated 2 years ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆97Updated 3 months ago
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆43Updated 8 months ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 5 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- Library for modelling performance costs of different Neural Network workloads on NPU devices☆35Updated 2 weeks ago
- Poplar libraries☆121Updated last year
- Conversions to MLIR EmitC☆133Updated 9 months ago
- Intel® FPGA Runtime for OpenCL™ Software Technology☆32Updated 7 months ago
- a simple end to end example of taking a ML graph (TF2 / PyTorch) and running it on a device [cpu, gpu]☆35Updated 4 years ago
- BLAS implementation for Intel FPGA☆77Updated 4 years ago
- ☆107Updated this week
- User-Mode Driver for Tenstorrent hardware☆33Updated this week
- Emulating DMA Engines on GPUs for Performance and Portability☆41Updated 10 years ago
- AMD's graph optimization engine.☆253Updated this week
- ☆19Updated last week
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated last year
- Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.☆25Updated 3 years ago
- Buda Compiler Backend for Tenstorrent devices☆30Updated 6 months ago