saqib1707 / gpt2-from-scratchLinks
PyTorch Implementation of GPT-2
☆14Updated 10 months ago
Alternatives and similar repositories for gpt2-from-scratch
Users that are interested in gpt2-from-scratch are comparing it to the libraries listed below
Sorting:
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- Recover Wi-Fi Password Using CMD, Windows PowerShell☆17Updated 2 years ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆111Updated 2 years ago
- Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.☆43Updated last year
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆295Updated 2 years ago
- LLaMA 2 implemented from scratch in PyTorch☆338Updated last year
- Quantization of LLMs and benchmarking.☆10Updated last year
- The auto mechanic finder is a web-based application. This is a platform which will allow customers to find the auto-mechanic from differe…☆9Updated last year
- Simple multi-language .Net Core CMS☆10Updated 6 years ago
- Personal GitHub profile showcasing AI, machine learning, and software development expertise.☆10Updated 3 weeks ago
- Reweight GPT - a simple neural network using transformer architecture for next character prediction☆56Updated last year
- ☆65Updated this week
- Slides for "Retrieval Augmented Generation" video☆20Updated last year
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆134Updated 8 months ago
- This is a PyTorch-based implementation of the Generative Adversarial Text-to-Image Synthesis paper, utilizing a GAN architecture inspired…☆20Updated 2 years ago
- AOs for every system! A middle-ground between Windows Command Prompt and PowerShell.☆10Updated 7 months ago
- GPU Kernels☆191Updated 2 months ago
- Testing KAN-based text generation GPT models☆18Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆315Updated this week
- ☆43Updated 2 months ago
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆120Updated 6 months ago
- A command-line utility for generating language-specific project structure.☆16Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆188Updated last month
- Experimenting with small language models☆68Updated last year
- The training notebooks that were similar to the original script used to train TinyMistral.☆22Updated last year
- The Tensor (or Array)☆438Updated 11 months ago
- a simplified version of Meta's Llama 3 model to be used for learning☆41Updated last year
- A discord library for FreakC (based on discord.bat)☆7Updated 3 years ago
- ☆184Updated 7 months ago
- Tutorial for how to build BERT from scratch☆96Updated last year