teddykoker / tinyloaderLinks
☆68Updated 9 months ago
Alternatives and similar repositories for tinyloader
Users that are interested in tinyloader are comparing it to the libraries listed below
Sorting:
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Updated 2 years ago
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 4 years ago
- ML/DL Math and Method notes☆65Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 4 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Updated 4 years ago
- Functional deep learning☆108Updated 3 years ago
- Amos optimizer with JEstimator lib.☆82Updated last year
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆67Updated 2 years ago
- Implementation of Flash Attention in Jax☆223Updated last year
- Machine Learning eXperiment Utilities☆46Updated 5 months ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆59Updated 2 years ago
- ☆118Updated 3 weeks ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆91Updated last year
- JAX implementation of Learning to learn by gradient descent by gradient descent☆28Updated 5 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated last year
- ☆31Updated last month
- ☆36Updated 3 years ago
- a lightweight transformer library for PyTorch☆72Updated 4 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Updated 7 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆189Updated 3 years ago
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Toy implementations of some popular ML optimizers using Python/JAX☆44Updated 4 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆116Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆69Updated 3 weeks ago