hkproj / transformer-from-scratch-notesLinks

Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)

☆295

Alternatives and similar repositories for transformer-from-scratch-notes

Users that are interested in transformer-from-scratch-notes are comparing it to the libraries listed below

Sorting:

hkproj / pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
☆111Updated last year
ajhalthor / Transformer-Neural-Network
Code Transformer neural network components piece by piece
☆356Updated 2 years ago
hkproj / pytorch-transformer
Attention is all you need implementation
☆978Updated last year
MK2112 / nn-zero-to-hero-notes
Jupyter Notebook notes on Andrej Karpathy's videos and the tutorial series, "Neural Networks: Zero to Hero."
☆176Updated this week
hkproj / pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
☆338Updated last year
1y33 / 100Days
GPU Kernels
☆190Updated 2 months ago
Denis2054 / Transformers-for-NLP-and-Computer-Vision-3rd-Edition
Transformers 3rd Edition
☆432Updated 2 months ago
hkproj / 100-days-of-gpu
☆350Updated 3 months ago
hkproj / mistral-src-commented
Reference implementation of Mistral AI 7B v0.1 model.
☆28Updated last year
StatQuest / decoder_transformer_from_scratch
☆134Updated last year
rasbt / pycon2024
Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024
☆246Updated last year
AayushSameerShah / Neural-Net-Zero-to-Hero-with-Andrej
This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried …
☆113Updated last year
rohan-paul / LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
☆546Updated 3 months ago
markhliu / DGAI
Learn Generative AI with PyTorch (Manning Publications, 2024)
☆112Updated last month
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆315Updated this week
0ssamaak0 / Karpathy-Neural-Networks-Zero-to-Hero
☆122Updated 5 months ago
arj7192 / MasteringPyTorchV2
☆145Updated last year
genaibook / genaibook
Contains the public resources of Hands on GenAI book
☆177Updated 6 months ago
springer-llms-deep-dive / llms-deep-dive-tutorials
☆125Updated 10 months ago
a-hamdi / GPU
100 days of building GPU kernels!
☆462Updated 2 months ago
FareedKhan-dev / create-million-parameter-llm-from-scratch
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆180Updated last year
Open-Deep-ML / DML-OpenProblem
☆461Updated last week
Lightning-AI / dl-fundamentals
Deep Learning Fundamentals -- Code material and exercises
☆386Updated last year
hkproj / quantization-notes
Notes on quantization in neural networks
☆90Updated last year
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆172Updated 10 months ago
LukeDitria / pytorch_tutorials
Beginner Level Deep Learning Tutorials in Pytorch with Youtube Videos!
☆379Updated 7 months ago
jeffheaton / app_deep_learning
T81-558: PyTorch - Applications of Deep Neural Networks @Washington University in St. Louis
☆451Updated 3 weeks ago
PacktPublishing / Modern-Computer-Vision-with-PyTorch-2E
Modern Computer Vision with PyTorch, 2E, Published by Packt
☆214Updated last month
hkproj / pytorch-paligemma
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw
☆504Updated 7 months ago
karpathy / transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆131Updated 3 years ago