skylineprof / skylineLinks
π Interactive in-editor performance profiling, visualization, and debugging for PyTorch neural networks.
β32Updated 3 years ago
Alternatives and similar repositories for skyline
Users that are interested in skyline are comparing it to the libraries listed below
Sorting:
- MONeT framework for reducing memory consumption of DNN trainingβ174Updated 4 years ago
- PyTorch implementation of L2L execution algorithmβ109Updated 3 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memoryβ137Updated 3 years ago
- Lightweight and Parallel Deep Learning Frameworkβ264Updated 3 years ago
- A GPU performance profiling tool for PyTorch modelsβ510Updated 4 years ago
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.β250Updated 3 years ago
- Programmable Neural Network Compressionβ149Updated 3 years ago
- PyProf2: PyTorch Profiling toolβ82Updated 5 years ago
- [ICLR 2020] Drawing Early-Bird Tickets: Toward More Efficient Training of Deep Networksβ140Updated 5 years ago
- Research and development for optimizing transformersβ131Updated 4 years ago
- Train ImageNet in 18 minutes on AWSβ134Updated last year
- ParaDnn: A systematic performance analysis methodology for deep learning.β40Updated 5 years ago
- Block-sparse primitives for PyTorchβ158Updated 4 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilationβ27Updated 6 years ago
- A tensor-aware point-to-point communication primitive for machine learningβ283Updated last month
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616β133Updated 2 years ago
- This repository contains the results and code for the MLPerfβ’ Training v0.7 benchmark.β57Updated 2 years ago
- β145Updated 2 years ago
- End-to-end training of sparse deep neural networks with little-to-no performance loss.β335Updated 3 years ago
- Code for reproducing work of ICML 2019 paper: Memory-Optimal Direct Convolutions for Maximizing Classification Accuracy in Embedded Appliβ¦β12Updated 6 years ago
- β41Updated last year
- Deadline-based hyperparameter tuning on RayTune.β32Updated 6 years ago
- Butterfly matrix multiplication in PyTorchβ178Updated 2 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compreβ¦β373Updated 3 weeks ago
- Customized matrix multiplication kernelsβ57Updated 3 years ago
- β58Updated 3 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large β¦β65Updated 3 years ago
- β107Updated 4 years ago
- Benchmark PyTorch Custom Operatorsβ14Updated 2 years ago
- Using ideas from product quantization for state-of-the-art neural network compression.β145Updated 4 years ago