Intel-tensorflow / tensorflowLinks
Computation using data flow graphs for scalable machine learning
☆67Updated this week
Alternatives and similar repositories for tensorflow
Users that are interested in tensorflow are comparing it to the libraries listed below
Sorting:
- oneAPI Collective Communications Library (oneCCL)☆237Updated last week
- To make it easy to benchmark AI accelerators☆184Updated 2 years ago
- oneCCL Bindings for Pytorch*☆97Updated 2 months ago
- ☆30Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆245Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆227Updated 3 weeks ago
- ☆416Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated 3 months ago
- OpenAI Triton backend for Intel® GPUs☆191Updated this week
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆98Updated 3 years ago
- heterogeneity-aware-lowering-and-optimization☆254Updated last year
- AMD's graph optimization engine.☆223Updated this week
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- Python bindings for NVTX☆66Updated 2 years ago
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆246Updated 2 weeks ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆337Updated this week
- ☆62Updated 6 months ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated last year
- ☆194Updated 2 years ago
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆152Updated this week
- ROCm Communication Collectives Library (RCCL)☆342Updated this week
- ☆108Updated last week
- A CUTLASS implementation using SYCL☆27Updated last week
- ☆20Updated 2 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆116Updated last year
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 2 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆83Updated 2 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated 2 months ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆133Updated last year
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆62Updated this week