DLPrimitives/OpenCL out of tree backend for pytorch
☆396Nov 26, 2025Updated 7 months ago
Alternatives and similar repositories for pytorch_dlprim
Users that are interested in pytorch_dlprim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Learning Primitives and Mini-Framework for OpenCL☆211Sep 9, 2024Updated last year
- Example of using pytorch's open device registration API☆31Oct 14, 2022Updated 3 years ago
- OpenCL port of TensorFlow using SYCL, generic instructions for building are here:☆61Mar 31, 2020Updated 6 years ago
- JAX interpreter for Vulkan☆17Jun 1, 2021Updated 5 years ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆355Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An OpenCL backend for torch.☆302Nov 16, 2016Updated 9 years ago
- A prototype CUDA-to-OpenCL source-to-source translator, built on the Clang compiler framework☆209Jul 12, 2020Updated 5 years ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆877Apr 23, 2025Updated last year
- Tuned OpenCL BLAS☆1,183Apr 13, 2026Updated 2 months ago
- Sample code for matrix transposition in Vulkan☆15Sep 19, 2022Updated 3 years ago
- The Riallto Open Source Project from AMD☆86Apr 10, 2025Updated last year
- Tensor Tiling Library☆42Sep 23, 2025Updated 9 months ago
- VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications.☆914Jan 21, 2024Updated 2 years ago
- ☆11Jun 15, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A pseudo random number generator library written against the SYCL API.☆11Jun 11, 2019Updated 7 years ago
- Easy to run kernels using OpenCL☆188Apr 22, 2025Updated last year
- buildroot fork from damien -- RV32 no MMU Linux. Run "make qemu_riscv32_nommu_virt_minimal_defconfig" then "make"☆26Apr 23, 2024Updated 2 years ago
- CUDA on non-NVIDIA GPUs☆14,328Updated this week
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYC…☆463Apr 20, 2025Updated last year
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,015Mar 30, 2026Updated 2 months ago
- OpenCL integration for Python, plus shiny features☆1,141Updated this week
- Implementation of OpenCL 3.0 on Vulkan☆441Jun 22, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15Jan 12, 2024Updated 2 years ago
- Public effort to document PCI-E Device support for Rockchip based (Single Board) Computers☆21Nov 6, 2025Updated 7 months ago
- The OpenCL ICD Loader project.☆299Jun 16, 2026Updated last week
- IREE compiler and runtime for Snitch☆15May 14, 2026Updated last month
- OpenCL library to train deep convolutional neural networks☆881Jan 5, 2018Updated 8 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- Benchmarking PyTorch 2.0 different models☆20Mar 19, 2023Updated 3 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- A synthetic micro-benchmark that measures peak compute, bandwidth, and matrix throughput of GPUs and CPUs☆500Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Simple starter CMake project that uses NVBench.☆15May 6, 2025Updated last year
- Various Arduino sketches☆15Jan 24, 2017Updated 9 years ago
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,887Updated this week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,850Jun 22, 2026Updated last week
- Exocompilation for productive programming of hardware accelerators☆733May 16, 2026Updated last month
- A portable GPU/CPU Path Tracer library powered by SYCL. (OpenCL/CUDA/OpenMP)☆16Feb 19, 2019Updated 7 years ago
- Yet another machine learning framework☆15Dec 14, 2018Updated 7 years ago