artyom-beilis/pytorch_dlprim

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/artyom-beilis/pytorch_dlprim)

artyom-beilis / pytorch_dlprim

DLPrimitives/OpenCL out of tree backend for pytorch

☆399

Alternatives and similar repositories for pytorch_dlprim

Users that are interested in pytorch_dlprim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

artyom-beilis / dlprimitives
View on GitHub
Deep Learning Primitives and Mini-Framework for OpenCL
☆211Sep 9, 2024Updated last year
bdhirsh / pytorch_open_registration_example
View on GitHub
Example of using pytorch's open device registration API
☆31Oct 14, 2022Updated 3 years ago
codeplaysoftware / tensorflow
View on GitHub
OpenCL port of TensorFlow using SYCL, generic instructions for building are here:
☆62Mar 31, 2020Updated 6 years ago
CHIP-SPV / chipStar
View on GitHub
chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.
☆364Updated this week
Rayfxl / mlir-zh
View on GitHub
MLIR 中文文档
☆22Dec 1, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vtsynergy / CU2CL
View on GitHub
A prototype CUDA-to-OpenCL source-to-source translator, built on the Clang compiler framework
☆209Jul 12, 2020Updated 6 years ago
hughperkins / coriander
View on GitHub
Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices
☆877Apr 23, 2025Updated last year
CNugteren / CLBlast
View on GitHub
Tuned OpenCL BLAS
☆1,186Apr 13, 2026Updated 3 months ago
alexander-g / vkJAX
View on GitHub
JAX interpreter for Vulkan
☆17Jun 1, 2021Updated 5 years ago
KhronosGroup / OpenCL-TTL
View on GitHub
Tensor Tiling Library
☆42Sep 23, 2025Updated 9 months ago
GoogleBot42 / Tracer
View on GitHub
A portable GPU/CPU Path Tracer library powered by SYCL. (OpenCL/CUDA/OpenMP)
☆16Feb 19, 2019Updated 7 years ago
TNTwise / Universal-NCNN-Upscaler
View on GitHub
☆19Sep 10, 2024Updated last year
albanD / pytorch_dev_env_setup
View on GitHub
☆11Updated this week
Shedou / Neuro
View on GitHub
Useful assemblies of neural network software.
☆14Oct 17, 2025Updated 9 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hughperkins / EasyCL
View on GitHub
Easy to run kernels using OpenCL
☆188Apr 22, 2025Updated last year
EdVince / model_zoo
View on GitHub
Recording models
☆12Sep 19, 2023Updated 2 years ago
Vogtinator / firebird
View on GitHub
Community emulator for TI nspire handhelds
☆12Jun 25, 2026Updated 3 weeks ago
vosen / ZLUDA
View on GitHub
CUDA on non-NVIDIA GPUs
☆14,630Updated this week
apuaaChen / EVT_AE
View on GitHub
Artifacts of EVT ASPLOS'24
☆29Mar 6, 2024Updated 2 years ago
microsoft / antares
View on GitHub
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYC…
☆464Apr 20, 2025Updated last year
intel / intel-extension-for-pytorch
View on GitHub
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
☆2,014Mar 30, 2026Updated 3 months ago
kpet / clvk
View on GitHub
Implementation of OpenCL 3.0 on Vulkan
☆442Updated this week
nsping13 / GAN-Steerability-without-optimization
View on GitHub
☆15Jan 12, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
KhronosGroup / OpenCL-Guide
View on GitHub
A guide to help developers get up and running quickly with the OpenCL programming framework
☆699Aug 7, 2024Updated last year
khaki3 / ptxas-wrapper
View on GitHub
A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code
☆16Mar 19, 2023Updated 3 years ago
hughperkins / DeepCL
View on GitHub
OpenCL library to train deep convolutional neural networks
☆881Jan 5, 2018Updated 8 years ago
NVIDIA / nvbench_demo
View on GitHub
Simple starter CMake project that uses NVBench.
☆15May 6, 2025Updated last year
krrishnarraj / clpeak
View on GitHub
A synthetic micro-benchmark that measures peak compute, bandwidth, and matrix throughput of GPUs and CPUs
☆505Updated this week
SamsungDS / unvme-cli
View on GitHub
Configure NVMe by CLI, and test it with fio!
☆17Updated this week
llvm / torch-mlir
View on GitHub
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
☆1,871Updated this week
exo-lang / exo
View on GitHub
Exocompilation for productive programming of hardware accelerators
☆736Jul 3, 2026Updated 2 weeks ago
AdaptiveCpp / AdaptiveCpp
View on GitHub
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …
☆1,910Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
intel / tiny-tensor-compiler
View on GitHub
☆21Jan 21, 2026Updated 6 months ago
ymd-h / vulkpy
View on GitHub
GPGPU array on Vulkan
☆17Jun 3, 2023Updated 3 years ago
KomputeProject / kompute
View on GitHub
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …
☆2,541Updated this week
ftynse / clint
View on GitHub
Chunky Loop Interaction
☆25Aug 13, 2019Updated 6 years ago
JoshMcguigan / nerve
View on GitHub
☆17Mar 2, 2020Updated 6 years ago
Blaok / fpga-runtime
View on GitHub
☆13Aug 1, 2024Updated last year
ExaWorks / SDK
View on GitHub
ExaWorks SDK
☆11Feb 1, 2024Updated 2 years ago