hbchen-one / Transformer-Models-from-Scratch
implementing various transformer models for various tasks
☆62Updated 2 years ago
Alternatives and similar repositories for Transformer-Models-from-Scratch:
Users that are interested in Transformer-Models-from-Scratch are comparing it to the libraries listed below
- code for the ddp tutorial☆32Updated 2 years ago
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracy☆127Updated last year
- Playground for Transformers☆47Updated last year
- several types of attention modules written in PyTorch for learning purposes☆45Updated 4 months ago
- ☆89Updated 2 years ago
- ☆167Updated 2 years ago
- Accelerate Model Training with PyTorch 2.X, published by Packt☆36Updated 8 months ago
- TensorFlow 2.* exercises from the book "Deep Learning with Python" by François Chollet☆45Updated 3 years ago
- ☆130Updated last year
- Distributed Machine Learning with Python, published by Packt☆39Updated last year
- Tutorial for how to build BERT from scratch☆87Updated 8 months ago
- Implementation of transformers based architecture in PyTorch.☆54Updated 4 years ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆89Updated 3 weeks ago
- Code implementation from my blog post: https://fkodom.substack.com/p/transformers-from-scratch-in-pytorch☆92Updated last year
- ☆78Updated last year
- ☆13Updated 2 years ago
- PDFs and Codelabs for the Efficient Deep Learning book.☆191Updated last year
- A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gra…☆191Updated last year
- Efficient Deep Learning Survey Paper☆33Updated 2 years ago
- MLPNAS code for Paperspace series on Neural Architecture Search☆22Updated last year
- ☆18Updated 2 years ago
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)☆16Updated last year
- ☆28Updated last year
- Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…☆17Updated last year
- [AAAI 2022] RareGAN: Generating Samples for Rare Classes☆22Updated 2 years ago
- Flexible Python library providing building blocks (layers) for reproducible Transformers research (Tensorflow ✅, Pytorch 🔜, and Jax 🔜)☆53Updated last year
- https://slds-lmu.github.io/seminar_multimodal_dl/☆167Updated 2 years ago
- Examples of using PyTorch hooks, as covered in my YouTube tutorial video.☆33Updated last year
- ☆72Updated 3 years ago