openxla / xprofLinks
A profiling and performance analysis tool for machine learning
☆413Updated this week
Alternatives and similar repositories for xprof
Users that are interested in xprof are comparing it to the libraries listed below
Sorting:
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆733Updated this week
- A performant and modular runtime for TensorFlow☆759Updated 2 weeks ago
- Continuous builder and binary build scripts for pytorch☆352Updated 2 weeks ago
- Guide for building custom op for TensorFlow☆382Updated 2 years ago
- PyTorch RFCs (experimental)☆134Updated 2 months ago
- The TensorFlow Cloud repository provides APIs that will allow to easily go from debugging and training your Keras and TensorFlow code in …☆381Updated 3 weeks ago
- TensorFlow/TensorRT integration☆743Updated last year
- A tensor-aware point-to-point communication primitive for machine learning☆262Updated last week
- ☆419Updated this week
- A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.☆1,548Updated 2 weeks ago
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆851Updated this week
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆383Updated this week
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,428Updated this week
- Example code and applications for machine learning on Graphcore IPUs☆326Updated last year
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆97Updated this week
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆976Updated this week
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆716Updated last week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,059Updated last year
- A GPU performance profiling tool for PyTorch models☆504Updated 4 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆349Updated this week
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆484Updated 3 weeks ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Updated 2 months ago
- PyTorch elastic training☆729Updated 3 years ago
- Python bindings for NVTX☆66Updated 2 years ago
- PyTorch C++ API Documentation☆234Updated this week
- PyTorch interface for the IPU☆180Updated last year
- Providing reproducibility in deep learning frameworks☆428Updated last year
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆157Updated 11 months ago
- Backward compatible ML compute opset inspired by HLO/MHLO☆525Updated this week