DLPrimitives/OpenCL out of tree backend for pytorch
☆388Nov 26, 2025Updated 3 months ago
Alternatives and similar repositories for pytorch_dlprim
Users that are interested in pytorch_dlprim are comparing it to the libraries listed below
Sorting:
- Deep Learning Primitives and Mini-Framework for OpenCL☆208Sep 9, 2024Updated last year
- Example of using pytorch's open device registration API☆31Oct 14, 2022Updated 3 years ago
- OpenCL port of TensorFlow using SYCL, generic instructions for building are here:☆61Mar 31, 2020Updated 5 years ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆320Mar 13, 2026Updated last week
- JAX interpreter for Vulkan☆16Jun 1, 2021Updated 4 years ago
- A prototype CUDA-to-OpenCL source-to-source translator, built on the Clang compiler framework☆208Jul 12, 2020Updated 5 years ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆875Apr 23, 2025Updated 10 months ago
- Tuned OpenCL BLAS☆1,169Feb 1, 2026Updated last month
- The Riallto Open Source Project from AMD☆85Apr 10, 2025Updated 11 months ago
- Tensor Tiling Library☆38Sep 23, 2025Updated 5 months ago
- VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications.☆904Jan 21, 2024Updated 2 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆49Aug 18, 2025Updated 7 months ago
- C API drivers for PYNQ FPGA board☆42Oct 10, 2025Updated 5 months ago
- ☆11Dec 9, 2025Updated 3 months ago
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 6 years ago
- Easy to run kernels using OpenCL☆187Apr 22, 2025Updated 10 months ago
- Recording models☆11Sep 19, 2023Updated 2 years ago
- CUDA on non-NVIDIA GPUs☆14,009Updated this week
- Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYC…☆464Apr 20, 2025Updated 11 months ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,014Mar 13, 2026Updated last week
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 3 years ago
- OpenCL integration for Python, plus shiny features☆1,130Mar 9, 2026Updated last week
- Implementation of OpenCL 3.0 on Vulkan☆424Mar 2, 2026Updated 2 weeks ago
- A guide to help developers get up and running quickly with the OpenCL programming framework☆678Aug 7, 2024Updated last year
- ☆15Jan 12, 2024Updated 2 years ago
- The OpenCL ICD Loader project.☆295Feb 6, 2026Updated last month
- IREE compiler and runtime for Snitch☆14Oct 9, 2025Updated 5 months ago
- Benchmarking PyTorch 2.0 different models☆20Mar 19, 2023Updated 3 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Mar 12, 2026Updated last week
- A tool which profiles OpenCL devices to find their peak capacities☆483Mar 10, 2026Updated last week
- Simple starter CMake project that uses NVBench.☆16May 6, 2025Updated 10 months ago
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,801Updated this week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,770Mar 13, 2026Updated last week
- Configure NVMe by CLI, and test it with fio!☆16Mar 13, 2026Updated last week
- ☪☮$m✡✝🍏linux, a Linux distribution based on cosmopolitan binaries☆21Dec 27, 2023Updated 2 years ago
- Exocompilation for productive programming of hardware accelerators☆716Mar 13, 2026Updated last week
- ☆21Jan 21, 2026Updated last month
- Get image width and height reading as few bytes as possible.☆18Apr 16, 2022Updated 3 years ago
- A portable GPU/CPU Path Tracer library powered by SYCL. (OpenCL/CUDA/OpenMP)☆16Feb 19, 2019Updated 7 years ago