Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
☆341May 28, 2023Updated 2 years ago
Alternatives and similar repositories for transformer-from-scratch-notes
Users that are interested in transformer-from-scratch-notes are comparing it to the libraries listed below
Sorting:
- Attention is all you need implementation☆1,176Jun 8, 2024Updated last year
- BERT explained from scratch☆16Oct 26, 2023Updated 2 years ago
- LLaMA 2 implemented from scratch in PyTorch☆365Sep 25, 2023Updated 2 years ago
- ☆13Oct 5, 2025Updated 4 months ago
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- Stable Diffusion implemented from scratch in PyTorch☆1,030Oct 22, 2024Updated last year
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Jun 19, 2017Updated 8 years ago
- The official baseline implementations for Chronocept☆10Dec 21, 2025Updated 2 months ago
- Gaze estimation from 2D image☆13Dec 17, 2024Updated last year
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Mar 4, 2025Updated last year
- ☆15Feb 23, 2026Updated last week
- Implementation of the paper "Denoising Diffusion Probabilistic Models" in PyTorch☆67Jul 4, 2023Updated 2 years ago
- Open sourced result for The Agent Company☆21Nov 11, 2025Updated 3 months ago
- GenAI Examples☆16Dec 13, 2024Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Apr 21, 2023Updated 2 years ago
- Cross-modal Hierarchical Modelling for FGSBIR. Work accepted for Oral presentation in BMVC 2020☆18Sep 8, 2023Updated 2 years ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆79Jan 28, 2024Updated 2 years ago
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆593Dec 6, 2024Updated last year
- HAM☆18Sep 19, 2021Updated 4 years ago
- Deep Learning Implementations☆17Dec 24, 2020Updated 5 years ago
- Finetuning BLOOM on a single GPU using gradient-accumulation☆31Mar 29, 2023Updated 2 years ago
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Jul 6, 2025Updated 7 months ago
- "Graph Convolutions Enrich the Self-Attention in Transformers!" NeurIPS 2024☆27Mar 19, 2025Updated 11 months ago
- Slides for "Retrieval Augmented Generation" video☆24Nov 27, 2023Updated 2 years ago
- ☆29Nov 9, 2025Updated 3 months ago
- ☆417Apr 10, 2025Updated 10 months ago
- Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.☆23Mar 14, 2019Updated 6 years ago
- 国科大2023秋季学期模式识别与机器学习笔记☆31Oct 2, 2024Updated last year
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆63Apr 14, 2024Updated last year
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- Notes and commented code for RLHF (PPO)☆126Feb 27, 2024Updated 2 years ago
- Course Materials for Practical Data Analysis with Python and SQL☆34Aug 4, 2024Updated last year
- A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much muc…☆200Jan 30, 2026Updated last month
- ☆294Feb 24, 2026Updated last week
- C/C++ Algorithms Implementation for Code In☆14Nov 15, 2015Updated 10 years ago
- ☆10Mar 18, 2024Updated last year
- This repository contains an implementation of the 3D watermarking algorithm proposed by Cayre et al based on Spectral Decomposition.☆11Jun 3, 2018Updated 7 years ago
- A n body simulation of our solar system completed in python☆11Dec 6, 2021Updated 4 years ago
- Node project to collect Posts, Like, Comments, Follows and Following stats from Instagram profiles without signing for their API☆12Mar 25, 2024Updated last year