IpsumDominum / Pytorch-Simple-Transformer
A simple transformer implementation without difficult syntax and extra bells and whistles.
☆43Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Pytorch-Simple-Transformer
- ☆18Updated 2 years ago
- AdaCat☆49Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 2 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆30Updated last year
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 2 years ago
- Visualize tensors in a plain Python REPL using Sparklines☆45Updated 3 years ago
- ☆30Updated 4 years ago
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 2 years ago
- Cellular automaton-based calculus for the masses☆24Updated 6 years ago
- Adaptation of discriminative learning rates from the Fastai library for standard PyTorch.☆18Updated 5 years ago
- A framework for implementing equivariant DL☆10Updated 3 years ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆20Updated 4 years ago
- A GPT, made only of MLPs, in Jax☆55Updated 3 years ago
- Write your code as tree-like expressions, then transform it☆21Updated 10 months ago
- ☆18Updated last year
- The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp☆19Updated 4 years ago
- A generative modelling toolkit for PyTorch.☆70Updated 3 years ago
- A stateful pytree library for training neural networks.☆21Updated 2 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 2 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 2 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Updated 3 years ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- An open source implementation of CLIP.☆32Updated 2 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆47Updated 2 years ago
- A JAX nn library☆21Updated 8 months ago