facebookexperimental / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆17Updated 5 years ago
Alternatives and similar repositories for tvm:
Users that are interested in tvm are comparing it to the libraries listed below
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 6 years ago
- Runtime Tracing Library for TensorFlow☆43Updated 6 years ago
- Benchmarks to capture important workloads.☆31Updated 2 months ago
- ☆40Updated 4 months ago
- ☆13Updated 3 years ago
- An experimental ahead of time compiler for Relay.☆50Updated 5 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆28Updated 5 years ago
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆26Updated this week
- Prototype routines for GPU quantization written using PyTorch.☆21Updated 2 months ago
- benchmarking some transformer deployments☆26Updated 2 years ago
- ☆23Updated last year
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆28Updated 3 years ago
- ☆16Updated 2 years ago
- An IR for efficiently simulating distributed ML computation.☆28Updated last year
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆27Updated 6 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- ☆11Updated 4 years ago
- Scoreboard for ONNX Backend Compatibility☆29Updated this week
- DLPack for Tensorflow☆35Updated 5 years ago
- Input-aware cuBLAS/clBLAS implementation for better performance☆17Updated 2 years ago
- ☆36Updated 2 years ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆62Updated 2 months ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆136Updated this week
- An FPGA integration and acceleration of the popular FAISS framework for approximate similarity search☆23Updated 5 years ago
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 2 years ago
- Visualize TVM Relay program graph☆12Updated 5 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆98Updated 3 years ago
- ☆60Updated last year
- A domain-specific language and compiler for image processing☆76Updated 4 years ago