IpsumDominum / Pytorch-Simple-TransformerLinks
A simple transformer implementation without difficult syntax and extra bells and whistles.
☆53Updated 2 years ago
Alternatives and similar repositories for Pytorch-Simple-Transformer
Users that are interested in Pytorch-Simple-Transformer are comparing it to the libraries listed below
Sorting:
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- AdaCat☆49Updated 3 years ago
- Simple dataset to dataloader library for pytorch☆33Updated 7 months ago
- An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches☆43Updated 3 years ago
- A generative modelling toolkit for PyTorch.☆70Updated 3 years ago
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40Updated 3 months ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Updated 3 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Updated 3 years ago
- ☆39Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- Check if you have training samples in your test set☆64Updated 3 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- ☆18Updated 3 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated 2 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- A stateful pytree library for training neural networks.☆22Updated 3 years ago
- Aloception is a set of package for computer vision: aloscene, alodataset, alonet.☆93Updated 2 months ago
- Functional deep learning☆108Updated 2 years ago
- ☆30Updated 5 years ago
- Visualize tensors in a plain Python REPL using Sparklines☆45Updated 4 years ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 3 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 5 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- Module 0 - Fundamentals☆105Updated 11 months ago
- Run compute jobs on AWS as if you were running them locally.☆125Updated 3 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep R…☆173Updated last month
- ☆20Updated 4 years ago