jax-ml / ml_dtypes
A stand-alone implementation of several NumPy dtype extensions used in machine learning.
☆225Updated this week
Alternatives and similar repositories for ml_dtypes:
Users that are interested in ml_dtypes are comparing it to the libraries listed below
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆101Updated this week
- jax-triton contains integrations between JAX and OpenAI Triton☆358Updated this week
- JAX-Toolbox☆268Updated this week
- This repository contains the experimental PyTorch native float8 training UX☆213Updated 4 months ago
- ☆271Updated last week
- A library for unit scaling in PyTorch☆112Updated 2 weeks ago
- A library to analyze PyTorch traces.☆312Updated 2 weeks ago
- Named Tensors for Legible Deep Learning in JAX☆154Updated last week
- Orbax provides common checkpointing and persistence utilities for JAX users☆314Updated this week
- extensible collectives library in triton☆73Updated 2 months ago
- TorchFix - a linter for PyTorch-using code with autofix support☆111Updated this week
- OpTree: Optimized PyTree Utilities☆156Updated this week
- ☆49Updated 4 months ago
- Experiment of using Tangent to autodiff triton☆72Updated 10 months ago
- ☆150Updated 6 months ago
- ☆178Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆150Updated last week
- ☆97Updated last week
- Multidimensional indexing for tensors☆115Updated last year
- seqax = sequence modeling + JAX☆136Updated 5 months ago
- JMP is a Mixed Precision library for JAX.☆188Updated last week
- PyTorch centric eager mode debugger☆43Updated 5 months ago
- A simple library for scaling up JAX programs☆128Updated last month
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆281Updated this week
- Implementation of Flash Attention in Jax☆201Updated 9 months ago
- Applied AI experiments and examples for PyTorch☆182Updated last week
- Fast low-bit matmul kernels in Triton☆161Updated last week
- NVIDIA Math Libraries for the Python Ecosystem☆214Updated 3 weeks ago
- Scalable and Performant Data Loading☆184Updated this week
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆109Updated last year