mingfeima / pytorch_profiler_parserLinks
parser script to process pytorch autograd profiler result, convert json file to excel.
☆14Updated 6 years ago
Alternatives and similar repositories for pytorch_profiler_parser
Users that are interested in pytorch_profiler_parser are comparing it to the libraries listed below
Sorting:
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆18Updated 6 years ago
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆24Updated 7 months ago
- oneCCL Bindings for Pytorch* (deprecated)☆103Updated last month
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆13Updated 8 months ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆140Updated 2 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆40Updated 5 years ago
- Research and development for optimizing transformers☆131Updated 4 years ago
- ☆50Updated 6 years ago
- System for automated integration of deep learning backends.☆47Updated 3 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆122Updated 3 years ago
- ☆83Updated 3 years ago
- A tool for examining GPU scheduling behavior.☆90Updated last year
- Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)☆146Updated 5 years ago
- Python bindings for NVTX☆67Updated 2 years ago
- Automated machine learning as an AI-HPC benchmark☆65Updated 3 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆180Updated 3 years ago
- ☆109Updated last year
- ☆42Updated 2 years ago
- ☆32Updated 3 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 7 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆134Updated 5 years ago
- Artifacts of EVT ASPLOS'24☆28Updated last year
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆43Updated 3 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆154Updated last week
- FTPipe and related pipeline model parallelism research.☆43Updated 2 years ago
- ☆23Updated 3 months ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆200Updated 3 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆90Updated 3 years ago
- A hierarchical collective communications library with portable optimizations☆37Updated last year
- A recommendation model kernel optimizing system☆12Updated 6 months ago