Intel-tensorflow / tensorflowLinks
Computation using data flow graphs for scalable machine learning
☆68Updated this week
Alternatives and similar repositories for tensorflow
Users that are interested in tensorflow are comparing it to the libraries listed below
Sorting:
- oneCCL Bindings for Pytorch*☆99Updated 3 weeks ago
- oneAPI Collective Communications Library (oneCCL)☆241Updated 3 weeks ago
- AMD's graph optimization engine.☆235Updated this week
- heterogeneity-aware-lowering-and-optimization☆255Updated last year
- ☆420Updated last week
- To make it easy to benchmark AI accelerators☆185Updated 2 years ago
- DeepLearning Framework Performance Profiling Toolkit☆285Updated 3 years ago
- ☆115Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆247Updated this week
- Python bindings for NVTX☆66Updated 2 years ago
- Training material for Nsight developer tools☆163Updated 11 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆345Updated this week
- Example of using pytorch's open device registration API☆30Updated 2 years ago
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆254Updated last month
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆163Updated this week
- ☆362Updated last year
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated last week
- ☆62Updated 7 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆444Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated last month
- Efficient Top-K implementation on the GPU☆183Updated 6 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆98Updated 3 years ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆427Updated last week
- OpenAI Triton backend for Intel® GPUs☆197Updated this week
- This repository contains the results and code for the MLPerf™ Inference v1.0 benchmark.☆32Updated last week
- A performant and modular runtime for TensorFlow☆758Updated this week
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆63Updated last week
- Assembler for NVIDIA Volta and Turing GPUs☆226Updated 3 years ago
- ☆196Updated 2 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆134Updated last year