graphcore / examplesLinks
Example code and applications for machine learning on Graphcore IPUs
☆323Updated last year
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- PyTorch interface for the IPU☆180Updated last year
- TensorFlow for the IPU☆78Updated last year
- Poplar Advanced Runtime for the IPU☆7Updated last year
- Training material for IPU users: tutorials, feature examples, simple applications☆86Updated 2 years ago
- Poplar libraries☆119Updated last year
- PyTorch RFCs (experimental)☆133Updated last month
- Training neural networks in TensorFlow 2.0 with 5x less memory☆132Updated 3 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆259Updated 2 years ago
- Research and development for optimizing transformers☆129Updated 4 years ago
- A library of GPU kernels for sparse matrix operations.☆270Updated 4 years ago
- Block-sparse primitives for PyTorch☆157Updated 4 years ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆166Updated last week
- Issues related to MLPerf™ training policies, including rules and suggested changes☆95Updated last week
- oneCCL Bindings for Pytorch*☆99Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆158Updated 3 weeks ago
- ☆251Updated 11 months ago
- Torch Distributed Experimental☆116Updated 11 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆343Updated this week
- ☆169Updated last year
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆57Updated 2 years ago
- An open-source efficient deep learning framework/compiler, written in python.☆708Updated this week
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated last year
- Python bindings for NVTX☆67Updated 2 years ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆280Updated this week
- Distributed preprocessing and data loading for language datasets☆39Updated last year
- Fast sparse deep learning on CPUs☆53Updated 2 years ago
- Implementation of a Transformer, but completely in Triton☆270Updated 3 years ago
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆194Updated 2 years ago
- Butterfly matrix multiplication in PyTorch☆172Updated last year