☆89Apr 28, 2026Updated this week
Alternatives and similar repositories for torch-xpu-ops
Users that are interested in torch-xpu-ops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆72Updated this week
- OpenAI Triton backend for Intel® GPUs☆249Updated this week
- KFunca: A minimalist, high-performance GPU-based automatic differentiation framework☆30Aug 14, 2025Updated 8 months ago
- ☆60Mar 6, 2026Updated last month
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,011Mar 30, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- oneAPI - Data Parallel C++ course for students☆44Nov 4, 2024Updated last year
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆266Apr 27, 2026Updated last week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Mar 11, 2026Updated last month
- The repository contains a reference end-to-end pipeline for a real-time video analytics application. Realtime data is provided to an infe…☆12Nov 3, 2025Updated 6 months ago
- Helper Files for IDC☆45Oct 23, 2023Updated 2 years ago
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆72Apr 27, 2026Updated last week
- Collection of small examples for running on ALCF resources☆21Apr 28, 2026Updated last week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Jul 31, 2025Updated 9 months ago
- VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.☆10Jul 8, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A tracing infrastructure for heterogeneous computing applications.☆41Updated this week
- ☆701Updated this week
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver☆1,377Updated this week
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 8 months ago
- ☆23Mar 16, 2026Updated last month
- ☆12Apr 24, 2025Updated last year
- ☆19Apr 24, 2026Updated last week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆49Aug 18, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Next generation ODE translator☆13Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆150Apr 23, 2026Updated last week
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Dec 6, 2022Updated 3 years ago
- ☆24Oct 9, 2025Updated 6 months ago
- Mini-Engine Demonstration of Combining XeSS with VRS Tier 2.☆14Jan 26, 2026Updated 3 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆29Apr 24, 2026Updated last week
- ☆20Apr 24, 2026Updated last week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆209Updated this week
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Computation using data flow graphs for scalable machine learning☆68Updated this week
- ☆32Jul 2, 2025Updated 10 months ago
- The Intel® Automated Self-Checkout Reference Package provides critical components required to build and deploy a self-checkout use case u…☆32Apr 24, 2026Updated last week
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated 2 years ago
- Implement Flash Attention using Cute.☆106Dec 17, 2024Updated last year
- A lattice QCD library.☆16Apr 27, 2026Updated last week
- PyTorch centric eager mode debugger☆48Dec 16, 2024Updated last year