graphcore / tensorflow
TensorFlow for the IPU
☆78Updated last year
Alternatives and similar repositories for tensorflow:
Users that are interested in tensorflow are comparing it to the libraries listed below
- Poplar Advanced Runtime for the IPU☆6Updated last year
- Poplar libraries☆117Updated last year
- PyTorch interface for the IPU☆177Updated last year
- Graph algorithms for machine learning frameworks☆27Updated last year
- Example code and applications for machine learning on Graphcore IPUs☆320Updated 11 months ago
- Training material for IPU users: tutorials, feature examples, simple applications☆86Updated last year
- Python bindings for NVTX☆66Updated last year
- MLIR-based partitioning system☆62Updated this week
- oneCCL Bindings for Pytorch*☆89Updated last month
- Assembler for NVIDIA Volta and Turing GPUs☆212Updated 3 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆32Updated 4 years ago
- ☆48Updated 11 months ago
- OpenAI Triton backend for Intel® GPUs☆165Updated this week
- Stores documents and resources used by the OpenXLA developer community☆117Updated 6 months ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆127Updated last year
- Issues related to MLPerf™ training policies, including rules and suggested changes☆94Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆130Updated this week
- A home for the final text of all TVM RFCs.☆102Updated 4 months ago
- ☆406Updated this week
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81Updated 2 years ago
- Shared Middle-Layer for Triton Compilation☆226Updated this week
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆64Updated 6 years ago
- oneAPI Collective Communications Library (oneCCL)☆222Updated 3 weeks ago
- NCCL Examples from Official NVIDIA NCCL Developer Guide.☆15Updated 6 years ago
- IREE plugin repository for the AMD AIE accelerator☆79Updated this week
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆32Updated this week
- System for automated integration of deep learning backends.☆48Updated 2 years ago
- ☆87Updated 10 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week