Intel-tensorflow / tensorflow
Computation using data flow graphs for scalable machine learning
☆67Updated this week
Alternatives and similar repositories for tensorflow:
Users that are interested in tensorflow are comparing it to the libraries listed below
- oneAPI Collective Communications Library (oneCCL)☆232Updated 2 weeks ago
- oneCCL Bindings for Pytorch*☆94Updated last week
- To make it easy to benchmark AI accelerators☆182Updated 2 years ago
- AMD's graph optimization engine.☆215Updated this week
- Python bindings for NVTX☆66Updated last year
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆109Updated 2 years ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated last month
- Stretching GPU performance for GEMMs and tensor contractions.☆235Updated last week
- heterogeneity-aware-lowering-and-optimization☆255Updated last year
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆98Updated 3 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆94Updated this week
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆236Updated this week
- ☆410Updated this week
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated last year
- DeepLearning Framework Performance Profiling Toolkit☆285Updated 3 years ago
- OpenAI Triton backend for Intel® GPUs☆179Updated this week
- Tools and extensions for CUDA profiling☆65Updated 5 years ago
- ☆339Updated last year
- ROCm Communication Collectives Library (RCCL)☆317Updated this week
- ☆30Updated 2 years ago
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆143Updated last week
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆62Updated last month
- RDMA and SHARP plugins for nccl library☆189Updated 2 weeks ago
- Benchmarks to capture important workloads.☆31Updated 2 months ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆396Updated 3 months ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆116Updated last year
- ☆60Updated 4 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated 2 weeks ago
- Compiler Infrastructure for Neural Networks☆145Updated last year