intel/sycl-tla

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intel/sycl-tla)

intel / sycl-tla

SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs

☆78

Alternatives and similar repositories for sycl-tla

Users that are interested in sycl-tla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

intel / torch-xpu-ops
View on GitHub
☆100Updated this week
intel / xetla
View on GitHub
☆61Dec 18, 2024Updated last year
intel / intel-xpu-backend-for-triton
View on GitHub
OpenAI Triton backend for Intel® GPUs
☆262Updated this week
intel / tiny-dpcpp-nn
View on GitHub
SYCL implementation of Fused MLPs for Intel GPUs
☆51Jul 17, 2026Updated last week
pengzhao-intel / oneAPI_course
View on GitHub
oneAPI - Data Parallel C++ course for students
☆43Nov 4, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
intel / metrics-discovery
View on GitHub
☆99Updated this week
intel / metrics-library
View on GitHub
☆19Jun 22, 2026Updated last month
oneapi-src / ishmem
View on GitHub
Intel® SHMEM - Device initiated shared memory based communication library
☆33Nov 12, 2025Updated 8 months ago
HabanaAI / gaudi-pytorch-bridge
View on GitHub
☆18Jul 13, 2026Updated last week
intel / pti-gpu
View on GitHub
Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…
☆271Updated this week
intel / intel-extension-for-deepspeed
View on GitHub
Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…
☆65May 27, 2026Updated last month
sgl-project / sgl-kernel-xpu
View on GitHub
SGLang kernel library for Intel XPU
☆27Updated this week
intel / intel-xai-tools
View on GitHub
Explainable AI Tooling (XAI). XAI is used to discover and explain a model's prediction in a way that is interpretable to the user. Releva…
☆39Sep 22, 2025Updated 10 months ago
xytpai / kfunca
View on GitHub
KFunca: A minimalist, high-performance GPU-based automatic differentiation framework
☆31Aug 14, 2025Updated 11 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
HabanaAI / vllm-fork
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆90Jul 13, 2026Updated last week
hkust-adsl / gass
View on GitHub
☆43Apr 3, 2022Updated 4 years ago
oneapi-src / level-zero-spec
View on GitHub
☆19Jun 26, 2026Updated last month
intel / llvm-test-suite
View on GitHub
☆20Mar 27, 2023Updated 3 years ago
intel / igsc
View on GitHub
Intel Graphics System Firmware Update Library (IGSC FUL) is a pure C low level library that exposes a required API to perform a firmware …
☆84Updated this week
intel / mlir-extensions
View on GitHub
Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.
☆153Updated this week
uxlfoundation / oneapi-construction-kit
View on GitHub
☆93May 1, 2026Updated 2 months ago
ORNL / HeCBench
View on GitHub
☆300Updated this week
meta-pytorch / triton-cpu
View on GitHub
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆48Aug 18, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
intel / level-zero-npu-extensions
View on GitHub
☆17Updated this week
coreyjadams / CosmicTagger
View on GitHub
Cosmic Tagging Network for Neutrino Physics
☆13Jun 26, 2024Updated 2 years ago
mmperf / mmperf
View on GitHub
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
☆138Sep 25, 2023Updated 2 years ago
argonne-lcf / Megatron-DeepSpeed
View on GitHub
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆17Mar 11, 2026Updated 4 months ago
huggingface / optimum-habana
View on GitHub
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
☆212Jul 6, 2026Updated 2 weeks ago
intel / video-streamer
View on GitHub
The repository contains a reference end-to-end pipeline for a real-time video analytics application. Realtime data is provided to an infe…
☆12Nov 3, 2025Updated 8 months ago
intel / xpumanager
View on GitHub
☆182Updated this week
xdslproject / training-intro
View on GitHub
Introduction to MLIR and xDSL training course
☆21Oct 2, 2023Updated 2 years ago
intel / gits
View on GitHub
API capture-replay tool for Vulkan, DirectX 12, OpenCL, Intel oneAPI Level Zero, and OpenGL
☆64Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
intel / auto-round
View on GitHub
A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support…
☆1,537Updated this week
Stardust-SJF / cuvs_rabitq
View on GitHub
cuVS - a library for vector search and clustering on the GPU. The IVF RaBitQ is under the cuvs_ivf_rabitq branch.
☆19Updated this week
uxlfoundation / oneCCL
View on GitHub
oneAPI Collective Communications Library (oneCCL)
☆268Updated this week
codeplaysoftware / portDNN
View on GitHub
portDNN is a library implementing neural network algorithms written using SYCL
☆114May 21, 2024Updated 2 years ago
VRGroupRWTH / mpi
View on GitHub
Header-only C++20 wrapper for MPI 4.0.
☆16Oct 20, 2023Updated 2 years ago
vllm-project / tml-fa4
View on GitHub
FA4-based Relative Attention Kernel developed by TML and Colfax
☆17Jul 17, 2026Updated last week
Multi-V-VM / DoubleJIT-VM
View on GitHub
A double JIT VM
☆23Jul 9, 2026Updated 2 weeks ago