intel/xetla

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intel/xetla)

intel / xetla

☆61

Alternatives and similar repositories for xetla

Users that are interested in xetla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

intel / intel-xpu-backend-for-triton
View on GitHub
OpenAI Triton backend for Intel® GPUs
☆258Updated this week
intel / sycl-tla
View on GitHub
SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs
☆76Updated this week
intel / intel-extension-for-deepspeed
View on GitHub
Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…
☆65May 27, 2026Updated last month
libxsmm / tpp-pytorch-extension
View on GitHub
Intel® Tensor Processing Primitives extension for Pytorch*
☆19Jul 4, 2026Updated 2 weeks ago
zjin-lcf / Rodinia_SYCL
View on GitHub
☆15Oct 20, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
intel / mlir-extensions
View on GitHub
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆153Updated this week
intel / tiny-dpcpp-nn
View on GitHub
SYCL implementation of Fused MLPs for Intel GPUs
☆51Updated this week
intel / cm-compiler
View on GitHub
☆154Jul 14, 2026Updated last week
intel / xpumanager
View on GitHub
☆180Updated this week
intel / intel-extension-for-tensorflow
View on GitHub
Intel® Extension for TensorFlow*
☆355Oct 29, 2025Updated 8 months ago
sammysun0711 / ov_llm_bench
View on GitHub
OpenVINO LLM Benchmark
☆11Dec 7, 2023Updated 2 years ago
oneapi-src / level-zero-spec
View on GitHub
☆19Jun 26, 2026Updated 3 weeks ago
intel / metrics-library
View on GitHub
☆19Jun 22, 2026Updated 3 weeks ago
deskvox / deskvox
View on GitHub
DeskVOX is a real-time visualization tool for 3D data sets like image stacks from CT or MRI scanners, or confocal microscopes. It has an …
☆21Jun 3, 2026Updated last month
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
uxlfoundation / oneapi-construction-kit
View on GitHub
☆93May 1, 2026Updated 2 months ago
intel / intel-graphics-compiler
View on GitHub
☆710Updated this week
intel / level-zero-npu-extensions
View on GitHub
☆17Jul 15, 2026Updated last week
oneapi-src / level-zero
View on GitHub
oneAPI Level Zero Specification Headers and Loader
☆331Updated this week
intel / pti-gpu
View on GitHub
Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…
☆270Jul 9, 2026Updated last week
intel / compute-runtime
View on GitHub
Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
☆1,419Updated this week
KhronosGroup / SYCL_Reference
View on GitHub
SYCL Reference Manual
☆30Feb 11, 2026Updated 5 months ago
ingowald / legacy-barney
View on GitHub
☆19Nov 2, 2025Updated 8 months ago
intel / torch-ccl
View on GitHub
oneCCL Bindings for Pytorch* (deprecated)
☆104Dec 31, 2025Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SC-SGS / Distributed_GPU_LSH_using_SYCL
View on GitHub
Distributed k-nearest Neighbors using Locality Sensitive Hashing and SYCL
☆10Updated this week
Apress / data-parallel-CPP
View on GitHub
Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…
☆288May 11, 2026Updated 2 months ago
HabanaAI / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆14Jan 8, 2026Updated 6 months ago
anadodik / sdmm-mitsuba
View on GitHub
Mitsuba Implementation of SDMM Path Guiding
☆18Mar 26, 2022Updated 4 years ago
intel / llvm-test-suite
View on GitHub
☆20Mar 27, 2023Updated 3 years ago
archibate / sycltutor
View on GitHub
小彭老师推出 SyCL 2020 课程（施工中，日后会在直播中放出）
☆15Sep 3, 2023Updated 2 years ago
libxsmm / tpp-mlir
View on GitHub
TPP experimentation on MLIR for linear algebra
☆155Updated this week
satishphd / Teaching-Intel-Intrinsics-for-SIMD-Parallelism
View on GitHub
Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class
☆19Feb 18, 2025Updated last year
codeplaysoftware / portDNN
View on GitHub
portDNN is a library implementing neural network algorithms written using SYCL
☆114May 21, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
oneapi-src / level-zero-tests
View on GitHub
oneAPI Level Zero Conformance & Performance test content
☆61Updated this week
sgl-project / sgl-kernel-xpu
View on GitHub
SGLang kernel library for Intel XPU
☆27Updated this week
intel / intel-xai-tools
View on GitHub
Explainable AI Tooling (XAI). XAI is used to discover and explain a model's prediction in a way that is interpretable to the user. Releva…
☆39Sep 22, 2025Updated 9 months ago
oneapi-src / SYCLomatic
View on GitHub
☆290Updated this week
mingfeima / pytorch_profiler_parser
View on GitHub
parser script to process pytorch autograd profiler result, convert json file to excel.
☆15Oct 8, 2019Updated 6 years ago
flatironinstitute / sf_benchmarks
View on GitHub
Special function benchmarks
☆13Feb 22, 2024Updated 2 years ago
libxsmm / libxsmm
View on GitHub
Library for specialized dense and sparse matrix operations, and deep learning primitives.
☆968Updated this week