intel / intel-extension-for-tensorflow
Intel® Extension for TensorFlow*
☆329Updated last month
Alternatives and similar repositories for intel-extension-for-tensorflow:
Users that are interested in intel-extension-for-tensorflow are comparing it to the libraries listed below
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆349Updated this week
- OpenAI Triton backend for Intel® GPUs☆165Updated this week
- Backward compatible ML compute opset inspired by HLO/MHLO☆446Updated this week
- ☆248Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,737Updated this week
- A collection of examples for the ROCm software stack☆185Updated this week
- ☆105Updated 3 months ago
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆499Updated 2 weeks ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆217Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆552Updated this week
- oneAPI Collective Communications Library (oneCCL)☆222Updated 3 weeks ago
- AMD's graph optimization engine.☆208Updated this week
- CUDA Kernel Benchmarking Library☆560Updated 2 months ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆443Updated this week
- oneAPI Level Zero Specification Headers and Loader☆237Updated this week
- ☆406Updated this week
- ☆43Updated last week
- GPUOcelot: A dynamic compilation framework for PTX☆166Updated last week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆60Updated 2 months ago
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated last year
- ROCm Communication Collectives Library (RCCL)☆297Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆424Updated last year
- An open-source efficient deep learning framework/compiler, written in python.☆681Updated last week
- oneAPI Specification source files☆195Updated 2 weeks ago
- oneCCL Bindings for Pytorch*☆88Updated last month
- ☆60Updated 2 months ago
- ☆81Updated this week
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆606Updated 3 months ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week