feather-ai / transformers-tutorial
The code for the video tutorial series on building a Transformer from scratch: https://www.youtube.com/watch?v=XR4VDnJzB8o
☆18Updated last year
Alternatives and similar repositories for transformers-tutorial:
Users that are interested in transformers-tutorial are comparing it to the libraries listed below
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- ☆24Updated 2 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆24Updated 2 years ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Updated 4 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆48Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Sequence models in Numpy☆25Updated 4 years ago
- NLP Examples using the 🤗 libraries☆41Updated 3 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆33Updated 4 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Updated last year
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.☆10Updated last year
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆31Updated 2 years ago
- ☆73Updated 2 years ago
- A simple Transformer where the softmax has been replaced with normalization☆19Updated 4 years ago
- Basic guidance on how to contribute to Papers with Code☆20Updated 2 years ago
- notebooks of cool EBM visualizations☆15Updated 4 years ago
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFace☆41Updated 3 months ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆32Updated 2 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆72Updated 2 years ago
- A tour of different optimization algorithms in PyTorch.☆95Updated 3 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 2 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Updated 3 years ago
- Cyclemoid implementation for PyTorch☆87Updated 2 years ago