AkiRusProd / numpy-transformer
A numpy implementation of the Transformer model in "Attention is All You Need"
☆50Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for numpy-transformer
- Сustom torch style machine learning framework with automatic differentiation implemented on numpy, allows build GANs, VAEs, etc.☆66Updated 3 months ago
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…☆54Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆142Updated 5 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆48Updated 7 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆82Updated last year
- Simple Adaptation of BitNet☆29Updated 7 months ago
- ☆68Updated 8 months ago
- LLaMA 2 implemented from scratch in PyTorch☆258Updated last year
- Examples of using PyTorch hooks, as covered in my YouTube tutorial video.☆32Updated last year
- A repository for log-time feedforward networks☆216Updated 7 months ago
- Notes on quantization in neural networks☆58Updated 11 months ago
- ML/DL Math and Method notes☆57Updated 11 months ago
- ☆13Updated last year
- ☆122Updated 4 months ago
- Prune transformer layers☆65Updated 5 months ago
- Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"☆60Updated last week
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆36Updated 2 months ago
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆10Updated 2 years ago
- Annotated version of the Mamba paper☆457Updated 8 months ago
- ☆44Updated 11 months ago
- LoRA and DoRA from Scratch Implementations☆188Updated 8 months ago
- Cataloging released Triton kernels.☆138Updated 2 months ago
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆54Updated last year
- ☆132Updated last year
- Implement Transformers (and Deep Learning) from scratch in NumPy☆23Updated last year
- I will build Transformer from scratch☆50Updated 6 months ago
- This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.☆149Updated 2 years ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆61Updated last month
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆143Updated last week
- A set of scripts and notebooks on LLM finetunning and dataset creation☆93Updated last month