shawwn / ml-notes
β39Updated 2 years ago
Alternatives and similar repositories for ml-notes:
Users that are interested in ml-notes are comparing it to the libraries listed below
- Python Research Frameworkβ106Updated 2 years ago
- A GPT, made only of MLPs, in Jaxβ57Updated 3 years ago
- π Pytorch code for the Nero optimiser.β20Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'β38Updated 3 years ago
- Image augmentation library for Jaxβ37Updated 10 months ago
- A case study of efficient training of large language models using commodity hardware.β68Updated 2 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variationβ12Updated 3 years ago
- My implementation of DeepMind's Perceiverβ61Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Trainingβ48Updated last year
- A JAX implementation of stochastic addition.β12Updated 2 years ago
- Babysit your preemptible TPUsβ85Updated 2 years ago
- A simple Transformer where the softmax has been replaced with normalizationβ19Updated 4 years ago
- A port of muP to JAX/Haikuβ25Updated 2 years ago
- β57Updated 2 years ago
- A stateful pytree library for training neural networks.β21Updated 2 years ago
- A collection of optimizers, some arcane others well known, for Flax.β29Updated 3 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodesβ237Updated last year
- A selection of neural network models ported from torchvision for JAX & Flax.β44Updated 4 years ago
- How to use the Flax Linen API to build a convolutional neural network model and train it for image classification (using TensorFlow Datasβ¦β24Updated last year
- β67Updated last year
- A framework for implementing equivariant DLβ10Updated 3 years ago
- Automatically take good care of your preemptible TPUsβ36Updated last year
- β153Updated 4 years ago
- Framework-agnostic library for checking array/tensor shapes at runtime.β47Updated 3 years ago
- This repository contains example code to build models on TPUsβ30Updated last year
- β108Updated 2 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimatorβ31Updated last year
- Amos optimizer with JEstimator lib.β81Updated 9 months ago
- Check if you have training samples in your test setβ64Updated 2 years ago
- Your fruity companion for transformersβ14Updated 2 years ago