hkproj / transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
☆270Updated last year
Alternatives and similar repositories for transformer-from-scratch-notes:
Users that are interested in transformer-from-scratch-notes are comparing it to the libraries listed below
- Attention is all you need implementation☆911Updated 10 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆102Updated last year
- LLaMA 2 implemented from scratch in PyTorch☆322Updated last year
- GPU Kernels☆172Updated last week
- Transformers 3rd Edition☆412Updated last week
- Code Transformer neural network components piece by piece☆342Updated 2 years ago
- Distributed training (multi-node) of a Transformer model☆65Updated last year
- Jupyter Notebook notes on Andrej Karpathy's videos and the tutorial series, "Neural Networks: Zero to Hero."☆162Updated this week
- I will build Transformer from scratch☆68Updated 11 months ago
- Notes about LLaMA 2 model☆59Updated last year
- ☆159Updated 4 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 11 months ago
- 100 days of building GPU kernels!☆399Updated last week
- Notes on quantization in neural networks☆80Updated last year
- This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried …☆109Updated last year
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆243Updated last year
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Sc…☆114Updated last week
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- ☆247Updated 3 months ago
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024☆244Updated 11 months ago
- repo of paper implementations☆19Updated 2 months ago
- LoRA and DoRA from Scratch Implementations☆203Updated last year
- Learn Generative AI with PyTorch (Manning Publications, 2024)☆92Updated 5 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆104Updated 3 months ago
- Alex Krizhevsky's original code from Google Code☆191Updated 9 years ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆158Updated 11 months ago
- ☆389Updated 2 weeks ago
- Machine Learning Q and AI book☆416Updated 7 months ago
- Repo for ML Models built from scratch such as Self-Attention, Linear +Logistic Regression, PCA, LDA. CNN, LSTM, Neural Networks using Nu…☆47Updated 3 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆459Updated 5 months ago