☆61Dec 18, 2024Updated last year
Alternatives and similar repositories for xetla
Users that are interested in xetla are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆72Apr 24, 2026Updated last week
- ☆60Mar 6, 2026Updated last month
- OpenAI Triton backend for Intel® GPUs☆246Apr 24, 2026Updated last week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆65Jun 30, 2025Updated 10 months ago
- Intel® Tensor Processing Primitives extension for Pytorch*☆18Apr 9, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆164Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆150Apr 23, 2026Updated last week
- SYCL implementation of Fused MLPs for Intel GPUs☆51Nov 24, 2025Updated 5 months ago
- ☆152Apr 23, 2026Updated last week
- Intel® Extension for TensorFlow*☆352Oct 29, 2025Updated 6 months ago
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- ☆17Updated this week
- ☆19Apr 24, 2026Updated last week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆259Jan 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆701Updated this week
- ☆94Apr 2, 2026Updated 3 weeks ago
- DeskVOX is a real-time visualization tool for 3D data sets like image stacks from CT or MRI scanners, or confocal microscopes. It has an …☆21Apr 24, 2026Updated last week
- Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver☆1,377Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆264Apr 9, 2026Updated 3 weeks ago
- SYCL Reference Manual☆30Feb 11, 2026Updated 2 months ago
- ☆19Nov 2, 2025Updated 5 months ago
- oneCCL Bindings for Pytorch* (deprecated)☆104Dec 31, 2025Updated 4 months ago
- Distributed k-nearest Neighbors using Locality Sensitive Hashing and SYCL☆10Jun 7, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆284Mar 26, 2025Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆14Jan 8, 2026Updated 3 months ago
- A general cubic equation solver and quartic equation minimisation solver written for CPU and Nvidia GPUs, for more details and results, s…☆10Jun 15, 2020Updated 5 years ago
- ☆20Mar 27, 2023Updated 3 years ago
- 小彭老师推出 SyCL 2020 课程(施工中,日后会在直播中放出)☆15Sep 3, 2023Updated 2 years ago
- TPP experimentation on MLIR for linear algebra☆147Apr 23, 2026Updated last week
- portDNN is a library implementing neural network algorithms written using SYCL☆114May 21, 2024Updated last year
- Mitsuba Implementation of SDMM Path Guiding☆18Mar 26, 2022Updated 4 years ago
- Explainable AI Tooling (XAI). XAI is used to discover and explain a model's prediction in a way that is interpretable to the user. Releva…☆39Sep 22, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆284Updated this week
- Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class☆17Feb 18, 2025Updated last year
- ☆99Mar 16, 2026Updated last month
- parser script to process pytorch autograd profiler result, convert json file to excel.☆15Oct 8, 2019Updated 6 years ago
- Special function benchmarks☆13Feb 22, 2024Updated 2 years ago
- An MLIR frontend for tensor expressions☆24Sep 5, 2020Updated 5 years ago
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆949Mar 18, 2026Updated last month