intel/torch-xpu-ops

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intel/torch-xpu-ops)

intel / torch-xpu-ops

☆99

Alternatives and similar repositories for torch-xpu-ops

Users that are interested in torch-xpu-ops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

intel / sycl-tla
View on GitHub
SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs
☆77Updated this week
intel / intel-xpu-backend-for-triton
View on GitHub
OpenAI Triton backend for Intel® GPUs
☆261Updated this week
intel / intel-extension-for-openxla
View on GitHub
☆61Mar 6, 2026Updated 4 months ago
xytpai / kfunca
View on GitHub
KFunca: A minimalist, high-performance GPU-based automatic differentiation framework
☆31Aug 14, 2025Updated 11 months ago
intel / onnxruntime
View on GitHub
ONNX Runtime: cross-platform, high performance scoring engine for ML models
☆88Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IntelLabs / Xe-Forge
View on GitHub
Multi-stage LLM agent pipeline for optimizing Triton kernels on Intel XPU — from analysis to autotuning.
☆16Updated this week
intel / intel-extension-for-pytorch
View on GitHub
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
☆2,014Mar 30, 2026Updated 3 months ago
HabanaAI / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆14Jan 8, 2026Updated 6 months ago
intel / intel-extension-for-deepspeed
View on GitHub
Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…
☆65May 27, 2026Updated last month
intel / ai-containers
View on GitHub
This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …
☆79May 27, 2026Updated last month
intel / intel-graphics-compiler
View on GitHub
☆710Updated this week
enp1s0 / cuMpSGEMM
View on GitHub
Fast SGEMM emulation on Tensor Cores
☆17Feb 16, 2025Updated last year
bjodom / idc
View on GitHub
Helper Files for IDC
☆45Oct 23, 2023Updated 2 years ago
HabanaAI / hccl_demo
View on GitHub
☆26Oct 9, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Jul 14, 2026Updated last week
pengzhao-intel / oneAPI_course
View on GitHub
oneAPI - Data Parallel C++ course for students
☆43Nov 4, 2024Updated last year
intel / compute-runtime
View on GitHub
Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
☆1,420Updated this week
HabanaAI / vllm-fork
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆90Jul 13, 2026Updated last week
intel / AI-Playground
View on GitHub
AI PC starter app for doing AI image creation, image stylizing, and chatbot on a PC powered by an Intel® Arc™ GPU.
☆939Updated this week
ROCm / hipDNN
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆57May 28, 2026Updated last month
huggingface / optimum-habana
View on GitHub
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
☆212Jul 6, 2026Updated 2 weeks ago
libxsmm / tpp-pytorch-extension
View on GitHub
Intel® Tensor Processing Primitives extension for Pytorch*
☆19Jul 4, 2026Updated 2 weeks ago
intel / llm-scaler
View on GitHub
☆430Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
intel / level-zero-npu-extensions
View on GitHub
☆17Jul 15, 2026Updated last week
intel / auto-round
View on GitHub
A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support…
☆1,534Updated this week
intel / tiny-dpcpp-nn
View on GitHub
SYCL implementation of Fused MLPs for Intel GPUs
☆51Jul 17, 2026Updated last week
intel / mlir-extensions
View on GitHub
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆153Updated this week
pytorch / test-infra
View on GitHub
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …
☆110Updated this week
argonne-lcf / Megatron-DeepSpeed
View on GitHub
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆17Mar 11, 2026Updated 4 months ago
n-eiling / cuda-fatbin-decompression
View on GitHub
☆24Jun 12, 2023Updated 3 years ago
intel / neural-compressor
View on GitHub
SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …
☆2,684Updated this week
oneapi-src / SYCLomatic
View on GitHub
☆290Updated this week
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
HabanaAI / gaudi-pytorch-bridge
View on GitHub
☆18Jul 13, 2026Updated last week
HabanaAI / SynapseAI_Core
View on GitHub
SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi
☆46Feb 3, 2025Updated last year
HabanaAI / vllm-hpu-extension
View on GitHub
☆16Jun 4, 2026Updated last month
Pranavchiku / Gesture-Detection-Application
View on GitHub
Developing multi platform gesture detector application by applying concepts learnt in Embedded Systems course on peripheral devices.
☆21Dec 8, 2023Updated 2 years ago
intel / tiny-tensor-compiler
View on GitHub
☆21Jan 21, 2026Updated 6 months ago
intel / llvm
View on GitHub
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
☆1,511Updated this week
HabanaAI / Model-References
View on GitHub
Reference models for Intel(R) Gaudi(R) AI Accelerator
☆172Jan 8, 2026Updated 6 months ago