☆80Mar 18, 2026Updated this week
Alternatives and similar repositories for torch-xpu-ops
Users that are interested in torch-xpu-ops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆69Updated this week
- OpenAI Triton backend for Intel® GPUs☆236Updated this week
- ☆58Mar 6, 2026Updated 2 weeks ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,014Mar 13, 2026Updated last week
- https://bbuf.github.io/gpu-glossary-zh/☆26Nov 7, 2025Updated 4 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆263Mar 17, 2026Updated last week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Mar 11, 2026Updated last week
- Cosmic Tagging Network for Neutrino Physics☆13Jun 26, 2024Updated last year
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆107Updated this week
- Helper Files for IDC☆45Oct 23, 2023Updated 2 years ago
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆64Mar 17, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- Collection of small examples for running on ALCF resources☆21Updated this week
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆176Mar 17, 2026Updated last week
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 11 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Jul 31, 2025Updated 7 months ago
- A tracing infrastructure for heterogeneous computing applications.☆40Updated this week
- VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.☆10Jul 8, 2022Updated 3 years ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- ☆282Updated this week
- Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver☆1,354Updated this week
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 7 months ago
- ☆13Aug 28, 2025Updated 6 months ago
- ☆20Jan 29, 2026Updated last month
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆49Aug 18, 2025Updated 7 months ago
- Yaksa: High-performance Noncontiguous Data Management☆16Oct 1, 2025Updated 5 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆148Updated this week
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Dec 6, 2022Updated 3 years ago
- ☆24Oct 9, 2025Updated 5 months ago
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆914Updated this week
- Adapt IPEX to CUDA☆41Jan 10, 2026Updated 2 months ago
- ☆128Updated this week
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆730Feb 11, 2026Updated last month
- The Intel® Automated Self-Checkout Reference Package provides critical components required to build and deploy a self-checkout use case u…☆32Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆207Mar 16, 2026Updated last week
- Computation using data flow graphs for scalable machine learning☆68Updated this week
- Implement Flash Attention using Cute.☆102Dec 17, 2024Updated last year
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated 2 years ago
- PyTorch centric eager mode debugger☆48Dec 16, 2024Updated last year