learning-at-home / lean_transformerLinks

Memory-efficient transformer. Work in progress.

☆19

Alternatives and similar repositories for lean_transformer

Users that are interested in lean_transformer are comparing it to the libraries listed below

Sorting:

lucidrains / einops-exts
Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️
☆55Updated 2 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
ColinQiyangLi / AdaCat
AdaCat
☆49Updated 3 years ago
lucidrains / local-attention-flax
Local Attention - Flax module for Jax
☆22Updated 4 years ago
facebookresearch / grounding-inductive-biases
reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"
☆17Updated last year
ClashLuke / PerfTorch
High performance pytorch modules
☆18Updated 2 years ago
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
lucidrains / compositional-attention-pytorch
Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…
☆51Updated 3 years ago
lucidrains / product-key-memory
Standalone Product Key Memory module in Pytorch - for augmenting Transformer models
☆83Updated last year
Zasder3 / open_clip_juwels
An open source implementation of CLIP.
☆33Updated 2 years ago
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆60Updated last week
google-research / precondition
☆31Updated 4 months ago
lucidrains / panoptic-transformer
Another attempt at a long-context / efficient transformer by me
☆38Updated 3 years ago
antofuller / configaformers
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆49Updated 3 years ago
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆58Updated 4 years ago
MathInf / toroidal
a lightweight transformer library for PyTorch
☆72Updated 3 years ago
AranKomat / Metroplex
☆21Updated 2 years ago
lucidrains / ESBN-pytorch
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Updated 4 years ago
pytorch / maskedtensor
MaskedTensors for PyTorch
☆38Updated 3 years ago
EleutherAI / equivariance
A framework for implementing equivariant DL
☆10Updated 4 years ago
iKernels / transformers-lightning
A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…
☆47Updated 2 years ago
nestordemeure / flaxOptimizers
A collection of optimizers, some arcane others well known, for Flax.
☆29Updated 4 years ago
lucidrains / esbn-transformer
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Updated 4 years ago
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆51Updated last year
mryab / learning-at-home
"Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts" (NeurIPS 2020), original PyTorch implemen…
☆56Updated 4 years ago
lucidrains / kronecker-attention-pytorch
Implementation of Kronecker Attention in Pytorch
☆19Updated 5 years ago
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆45Updated 2 years ago
ahennequ / pytorch-custom-mma
☆29Updated 3 years ago
lucidrains / kalman-filtering-attention
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆59Updated last year