r0mainK / outperformerLinks

Code for scaling Transformers

☆26

Alternatives and similar repositories for outperformer

Users that are interested in outperformer are comparing it to the libraries listed below

Sorting:

prajjwal1 / fluence
A deep learning library based on Pytorch focussed on low resource language research and robustness
☆70Updated 3 years ago
pytorch-tpu / examples
This repository contains example code to build models on TPUs
☆30Updated 2 years ago
lucidrains / g-mlp-gpt
GPT, but made only out of MLPs
☆89Updated 4 years ago
giannisdaras / smyrf
[NeurIPS 2020] Official Implementation: "SMYRF: Efficient Attention using Asymmetric Clustering".
☆50Updated last year
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆58Updated 4 years ago
YeonwooSung / GLOM
PyTorch implementation of GLOM
☆22Updated 3 years ago
yk / PyTorch_CIFAR10
Pretrained TorchVision models on CIFAR10 dataset (with weights)
☆24Updated 4 years ago
iKernels / transformers-lightning
A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…
☆47Updated 2 years ago
ducha-aiki / hardnet-in-fastai2-and-kornia
Re-implementation of local descriptor HardNet training in fasta2+kornia
☆21Updated 5 years ago
harvardnlp / hmm-lm
☆41Updated 4 years ago
tensorfork / OBST
Your fruity companion for transformers
☆14Updated 3 years ago
ofirpress / shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
☆147Updated 3 years ago
kingoflolz / cc_img_dl
☆27Updated 4 years ago
Popgun-Labs / PopGen
A generative modelling toolkit for PyTorch.
☆70Updated 3 years ago
cgraywang / transformer-on-diet
Code repo for "Transformer on a Diet" paper
☆31Updated 5 years ago
antofuller / configaformers
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆49Updated 3 years ago
kyunghyuncho / backprop-kalman-filter
☆45Updated 5 years ago
Holmeswww / PPOGAN
☆24Updated last year
RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
sayakpaul / EvoNorms-in-TensorFlow-2
Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.
☆11Updated 5 years ago
stanis-morozov / prodige
A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.
☆47Updated 5 years ago
TimDettmers / transformer-xl
☆64Updated 5 years ago
eyalbd2 / RL-based-Language-Modeling
☆13Updated 6 years ago
lucidrains / feedback-transformer-pytorch
Implementation of Feedback Transformer in Pytorch
☆107Updated 4 years ago
TomFrederik / grokking
Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'
☆38Updated 3 years ago
lnsmith54 / BOSS
This repository provides the code for replicating the experiments in the paper "Building One-Shot Semi-supervised (BOSS) Learning up to F…
☆36Updated 4 years ago
rajammanabrolu / Q-BERT
Agents that build knowledge graphs and explore textual worlds by asking questions
☆79Updated last year
theblackcat102 / H5Record
Large dataset storage format for Pytorch
☆45Updated 3 years ago
ClashLuke / PerfTorch
High performance pytorch modules
☆18Updated 2 years ago
nvecoven / BRC
A repository containing the code for the Bistable Recurrent Cell
☆47Updated 4 years ago