graphcore-research / gfloat
Generic floating-point types in Python
☆12Updated this week
Alternatives and similar repositories for gfloat:
Users that are interested in gfloat are comparing it to the libraries listed below
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆45Updated 8 months ago
- ☆51Updated 7 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆109Updated last month
- TORCH_LOGS parser for PT2☆35Updated last week
- ☆21Updated 3 weeks ago
- Personal solutions to the Triton Puzzles☆18Updated 8 months ago
- ☆27Updated 2 months ago
- Explore training for quantized models☆17Updated 2 months ago
- Experiment of using Tangent to autodiff triton☆78Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆255Updated this week
- A lightweight, Pythonic, frontend for MLIR☆80Updated last year
- Tensor Parallelism with JAX + Shard Map☆11Updated last year
- Unit Scaling demo and experimentation code☆16Updated last year
- A library for unit scaling in PyTorch☆124Updated 3 months ago
- ☆20Updated last year
- ☆36Updated 3 months ago
- JAX for Graphcore IPU (experimental)☆21Updated last year
- The official, proof-of-concept C++ implementation of PocketNN.☆32Updated 9 months ago
- If it quacks like a tensor...☆57Updated 4 months ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆19Updated 7 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated last year
- Implementation of PSGD optimizer in JAX☆29Updated 2 months ago
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- ☆38Updated last year
- An implementation of the Llama architecture, to instruct and delight☆21Updated 2 months ago
- extensible collectives library in triton☆84Updated 6 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 3 months ago
- ☆15Updated 11 months ago
- The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"☆18Updated 8 months ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆14Updated 4 months ago