teddykoker / tinyloaderLinks
☆64Updated 3 months ago
Alternatives and similar repositories for tinyloader
Users that are interested in tinyloader are comparing it to the libraries listed below
Sorting:
- JAX implementation of Learning to learn by gradient descent by gradient descent☆27Updated 8 months ago
- Machine Learning eXperiment Utilities☆46Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆58Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 3 weeks ago
- some common Huggingface transformers in maximal update parametrization (µP)☆81Updated 3 years ago
- Tensor Parallelism with JAX + Shard Map☆11Updated last year
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 3 years ago
- This is a port of Mistral-7B model in JAX☆32Updated 11 months ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated last year
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- PyTorch implementation of GLOM☆22Updated 3 years ago
- code for the ddp tutorial☆32Updated 3 years ago
- My explorations into editing the knowledge and memories of an attention network☆35Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆56Updated last week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- A selection of neural network models ported from torchvision for JAX & Flax.☆44Updated 4 years ago
- AdaCat☆49Updated 2 years ago
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- Lightning-like training API for JAX with Flax☆41Updated 6 months ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Updated last year
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 3 years ago
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆67Updated last year
- ☆74Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Simply Numpy implementation of the FAVOR+ attention mechanism, https://teddykoker.com/2020/11/performers/☆38Updated 4 years ago
- Various transformers for FSDP research☆37Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆33Updated last year