Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
☆347May 28, 2023Updated 2 years ago
Alternatives and similar repositories for transformer-from-scratch-notes
Users that are interested in transformer-from-scratch-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attention is all you need implementation☆1,191Jun 8, 2024Updated last year
- BERT explained from scratch☆17Oct 26, 2023Updated 2 years ago
- Notes about LLaMA 2 model☆73Aug 30, 2023Updated 2 years ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆127Jul 24, 2023Updated 2 years ago
- Stable Diffusion implemented from scratch in PyTorch☆1,045Oct 22, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆181Jan 7, 2024Updated 2 years ago
- Everything related to the reading group.☆10Oct 29, 2025Updated 5 months ago
- The repository includes detailed steps to get data from GES DISC, convert HDF5 files to CSV and plotting geographic data.☆11Aug 17, 2020Updated 5 years ago
- Slides for "Retrieval Augmented Generation" video☆26Nov 27, 2023Updated 2 years ago
- mathematica (miscellaneous)☆19Nov 1, 2025Updated 5 months ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆601Dec 6, 2024Updated last year
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- ☆12Jan 16, 2022Updated 4 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Oct 5, 2025Updated 6 months ago
- An Interfernce RAG-based LLM Pipeline with Best Practice LLMOps☆13Aug 20, 2024Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- ☆423Apr 10, 2025Updated 11 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆63Apr 14, 2024Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Apr 21, 2023Updated 2 years ago
- Sophgo AI chips driver and runtime library.☆24Updated this week
- Code Transformer neural network components piece by piece☆380May 1, 2023Updated 2 years ago
- Use AI to send verbal nudges to Alzheimer's patients with anterograde amnesia.☆12Oct 15, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Open sourced result for The Agent Company☆21Nov 11, 2025Updated 4 months ago
- GenAI Examples☆16Dec 13, 2024Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆79Jan 28, 2024Updated 2 years ago
- Rethinking materials simulations: Blending DNS with Neural Operators☆22Jul 9, 2024Updated last year
- ☆109Dec 10, 2025Updated 3 months ago
- ☆23Jun 12, 2023Updated 2 years ago
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 6 years ago
- ☆20Jul 3, 2024Updated last year
- 😱 This Python script provides a comprehensive analysis of stock options using data retrieved from Yahoo Finance. It calculates various m…☆15Mar 1, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Training LLMs with QLoRA + FSDP☆1,538Nov 9, 2024Updated last year
- ☆12Dec 3, 2020Updated 5 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆55,825Nov 12, 2025Updated 4 months ago
- ☆4,616Jan 31, 2024Updated 2 years ago
- Multi-factor Risk Models of Asset or Portfolio Returns☆10May 4, 2021Updated 4 years ago
- 📚 LaTeX templates and tools for creating beautiful, structured documents 📝☆14Oct 24, 2025Updated 5 months ago
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆287Mar 27, 2026Updated last week