jiazhihao / TASOLinks
The Tensor Algebra SuperOptimizer for Deep Learning
☆730Updated 2 years ago
Alternatives and similar repositories for TASO
Users that are interested in TASO are comparing it to the libraries listed below
Sorting:
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆994Updated last year
- Dive into Deep Learning Compiler☆648Updated 3 years ago
- ☆194Updated 2 years ago
- TVM integration into PyTorch☆454Updated 5 years ago
- ☆421Updated this week
- ☆243Updated 2 months ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆921Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,444Updated last week
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆200Updated 3 years ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆895Updated 9 months ago
- Place for meetup slides☆140Updated 4 years ago
- ☆145Updated 8 months ago
- common in-memory tensor structure☆1,072Updated 3 weeks ago
- Symbolic Expression and Statement Module for new DSLs☆206Updated 4 years ago
- ☆392Updated 2 years ago
- heterogeneity-aware-lowering-and-optimization☆256Updated last year
- row-major matmul optimization☆672Updated last month
- A performant and modular runtime for TensorFlow☆760Updated 3 weeks ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆180Updated 3 years ago
- System for automated integration of deep learning backends.☆47Updated 3 years ago
- Experimental projects related to TensorRT☆112Updated this week
- A tensor-aware point-to-point communication primitive for machine learning☆273Updated last month
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆122Updated 3 years ago
- A library of GPU kernels for sparse matrix operations.☆273Updated 4 years ago
- Assembler for NVIDIA Volta and Turing GPUs☆230Updated 3 years ago
- A model compilation solution for various hardware☆450Updated last month
- Backward compatible ML compute opset inspired by HLO/MHLO☆543Updated last week
- Shared Middle-Layer for Triton Compilation☆288Updated last week
- Fast CUDA Kernels for ResNet Inference.☆180Updated 6 years ago
- Repository for SysML19 Artifacts Evaluation☆54Updated 6 years ago