onnx / onnx-xla
XLA integration of Open Neural Network Exchange (ONNX)
☆19Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for onnx-xla
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 4 years ago
- ☆67Updated last year
- Scoreboard for ONNX Backend Compatibility☆27Updated this week
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 8 years ago
- A sandbox for quick iteration and experimentation on projects related to IREE, MLIR, and LLVM☆55Updated 2 months ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 6 years ago
- Benchmarks to capture important workloads.☆28Updated 5 months ago
- ☆26Updated last year
- Fast matrix multiplication for few-bit integer matrices on CPUs.☆27Updated 5 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- GEMM and Winograd based convolutions using CUTLASS☆25Updated 4 years ago
- Experiments and prototypes associated with IREE or MLIR☆49Updated 3 months ago
- portDNN is a library implementing neural network algorithms written using SYCL☆108Updated 6 months ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆37Updated 9 months ago
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Updated 5 years ago
- GraphDef Editor: A port of the TensorFlow contrib.graph_editor package that operates over serialized graphs☆31Updated 2 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆93Updated this week
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- Benchmark scripts for TVM☆73Updated 2 years ago
- ☆12Updated 3 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆26Updated 5 years ago
- Tests and benchmarks for cudnn (and in the future, other nvidia libraries)☆53Updated 4 years ago
- A Winograd Minimal Filter Implementation in CUDA☆23Updated 3 years ago
- An IR for efficiently simulating distributed ML computation.☆25Updated 10 months ago
- MLIRX is now defunct. Please see PolyBlocks - https://docs.polymagelabs.com☆38Updated 11 months ago
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆57Updated 2 weeks ago
- A tracing JIT compiler for PyTorch☆12Updated 2 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆49Updated 6 years ago