CODAIT / graph_def_editor
GraphDef Editor: A port of the TensorFlow contrib.graph_editor package that operates over serialized graphs
☆31Updated last year
Related projects: ⓘ
- Python bindings for NVTX☆66Updated last year
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆56Updated last year
- distributed-embeddings is a library for building large embedding based models in Tensorflow 2.☆41Updated 11 months ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆97Updated 2 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆247Updated last year
- oneCCL Bindings for Pytorch*☆83Updated last week
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- Convert nvprof profiles into about:tracing compatible JSON files☆67Updated 3 years ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆55Updated 2 months ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆108Updated 10 months ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆61Updated 2 years ago
- The Triton backend for the PyTorch TorchScript models.☆117Updated last week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆91Updated last year
- An analytical performance modeling tool for deep neural networks.☆85Updated 3 years ago
- DLPack for Tensorflow☆36Updated 4 years ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆28Updated 2 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 8 years ago
- Simple Distributed Deep Learning on TensorFlow☆134Updated last year
- ☆47Updated last year
- Lightweight and Parallel Deep Learning Framework☆261Updated last year
- PyProf2: PyTorch Profiling tool☆83Updated 4 years ago
- Home for OctoML PyTorch Profiler☆105Updated last year
- Benchmarks to capture important workloads.☆28Updated 3 months ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆63Updated 6 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆127Updated 2 years ago
- PyTorch RFCs (experimental)☆120Updated 3 weeks ago
- Python bindings for UCX☆120Updated this week
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 4 years ago
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- An IR for efficiently simulating distributed ML computation.☆24Updated 8 months ago