jax-ml / ml_dtypesLinks
A stand-alone implementation of several NumPy dtype extensions used in machine learning.
☆328Updated last week
Alternatives and similar repositories for ml_dtypes
Users that are interested in ml_dtypes are comparing it to the libraries listed below
Sorting:
- ☆344Updated 3 weeks ago
- TorchFix - a linter for PyTorch-using code with autofix support☆152Updated 5 months ago
- OpTree: Optimized PyTree Utilities☆205Updated last month
- jax-triton contains integrations between JAX and OpenAI Triton☆437Updated last month
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆130Updated last week
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆732Updated this week
- ☆189Updated last year
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Updated 3 weeks ago
- This repository contains the experimental PyTorch native float8 training UX☆227Updated last year
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆104Updated this week
- ☆192Updated 3 weeks ago
- JAX-Toolbox☆381Updated last week
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- ☆21Updated 11 months ago
- Tokamax: A GPU and TPU kernel library.☆169Updated this week
- extensible collectives library in triton☆93Updated 10 months ago
- Experiment of using Tangent to autodiff triton☆82Updated 2 years ago
- TORCH_TRACE parser for PT2☆75Updated this week
- A library for unit scaling in PyTorch☆133Updated 6 months ago
- ☆28Updated last year
- Named Tensors for Legible Deep Learning in JAX☆218Updated 2 months ago
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆171Updated this week
- Implementation of Flash Attention in Jax☆225Updated last year
- High-Performance FP32 GEMM on CUDA devices☆117Updated last year
- ☆55Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆155Updated 2 years ago
- Orbax provides common checkpointing and persistence utilities for JAX users☆479Updated this week
- PyTorch RFCs (experimental)☆138Updated 8 months ago
- PyTorch centric eager mode debugger☆48Updated last year
- A library to analyze PyTorch traces.☆464Updated last week