skylineprof / skylineLinks
π Interactive in-editor performance profiling, visualization, and debugging for PyTorch neural networks.
β33Updated 2 years ago
Alternatives and similar repositories for skyline
Users that are interested in skyline are comparing it to the libraries listed below
Sorting:
- MONeT framework for reducing memory consumption of DNN trainingβ173Updated 4 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memoryβ132Updated 3 years ago
- PyTorch implementation of L2L execution algorithmβ107Updated 2 years ago
- Research and development for optimizing transformersβ129Updated 4 years ago
- This repository contains the results and code for the MLPerfβ’ Training v0.7 benchmark.β57Updated 2 years ago
- Block-sparse primitives for PyTorchβ157Updated 4 years ago
- β40Updated 7 months ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616β132Updated 2 years ago
- A tensor-aware point-to-point communication primitive for machine learningβ259Updated 2 years ago
- Lightweight and Parallel Deep Learning Frameworkβ264Updated 2 years ago
- System for automated integration of deep learning backends.β47Updated 2 years ago
- PyTorch interface for the IPUβ180Updated last year
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large β¦β65Updated 3 years ago
- A GPU performance profiling tool for PyTorch modelsβ503Updated 4 years ago
- A schedule language for large model trainingβ149Updated last year
- β57Updated 3 years ago
- Torch Distributed Experimentalβ116Updated 11 months ago
- Butterfly matrix multiplication in PyTorchβ172Updated last year
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.β251Updated 2 years ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)β¦β29Updated 3 years ago
- End-to-end training of sparse deep neural networks with little-to-no performance loss.β323Updated 2 years ago
- Benchmark PyTorch Custom Operatorsβ14Updated 2 years ago
- β43Updated last year
- Customized matrix multiplication kernelsβ56Updated 3 years ago
- DLPack for Tensorflowβ35Updated 5 years ago
- β22Updated 4 years ago
- Programmable Neural Network Compressionβ148Updated 3 years ago
- PyProf2: PyTorch Profiling toolβ82Updated 5 years ago
- β251Updated 11 months ago
- FTPipe and related pipeline model parallelism research.β41Updated 2 years ago