feather-ai / transformers-tutorialLinks
The code for the video tutorial series on building a Transformer from scratch: https://www.youtube.com/watch?v=XR4VDnJzB8o
☆19Updated 2 years ago
Alternatives and similar repositories for transformers-tutorial
Users that are interested in transformers-tutorial are comparing it to the libraries listed below
Sorting:
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆59Updated 3 years ago
- Simple illustrative examples for energy-based models in PyTorch☆67Updated 5 years ago
- All about the fundamentals and working of Diffusion Models☆159Updated 2 years ago
- NLP Examples using the 🤗 libraries☆40Updated 4 years ago
- ☆20Updated 3 years ago
- ☆75Updated 3 years ago
- ☆24Updated 3 years ago
- The Forward-Forward Algorithm for Drug Discovery☆34Updated 2 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Introductory lecture on Pytorch☆17Updated 3 years ago
- A tour of different optimization algorithms in PyTorch.☆99Updated 3 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Updated 5 years ago
- Computer Vision and Pattern Recognition, NUS CS4243, 2022☆177Updated 3 years ago
- ML Research paper summaries, annotated papers and implementation walkthroughs☆113Updated 3 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 3 years ago
- This repository provides a Colab Notebook that shows how to use Spatial Transformer Networks inside CNNs in Keras.☆37Updated 3 years ago
- Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.☆61Updated 4 years ago
- ☆15Updated 3 years ago
- notebooks of cool EBM visualizations☆15Updated 4 years ago
- All about the fundamental blocks of TF and JAX!☆275Updated 3 years ago
- Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …☆66Updated 5 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 3 years ago
- Basic guidance on how to contribute to Papers with Code☆24Updated 3 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Example unit tests for deep learning projects.☆37Updated 5 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Updated 3 years ago
- ☆134Updated 2 years ago
- Pytorch implementation of bistable recurrent cell with baseline comparisons.☆25Updated 2 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆38Updated 3 years ago
- https://slds-lmu.github.io/seminar_multimodal_dl/☆171Updated 2 years ago