bashnick / transformer

A codebase implementing a simple GPT-like model from scratch based on the Attention is All You Need paper.

☆69

Alternatives and similar repositories for transformer:

Users that are interested in transformer are comparing it to the libraries listed below

evintunador / minLlama3
a simplified version of Meta's Llama 3 model to be used for learning
☆41Updated 11 months ago
yurakuratov / t5-experiments
Tools and scripts for experimenting with Transformers: Bert, T5...
☆56Updated last year
coaxsoft / pytorch_bert
Tutorial for how to build BERT from scratch
☆93Updated 11 months ago
TianyiPeng / Colab_for_Alpaca_Lora
Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)
☆38Updated 2 years ago
cindysridykhan / instruct_storyteller_tinyllama2
Training and Fine-tuning an llm in Python and PyTorch.
☆41Updated last year
milmor / GPT
Implementation of Generative Pretrained Transformer Model in Tensorflow / Keras
☆34Updated 11 months ago
edumunozsala / llama-2-7B-4bit-python-coder
Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..
☆64Updated last year
fkodom / transformer-from-scratch
Code implementation from my blog post: https://fkodom.substack.com/p/transformers-from-scratch-in-pytorch
☆94Updated last year
nyu-mll / ILF-for-code-generation
☆75Updated last month
loubnabnl / santacoder-finetuning
Fine-tune SantaCoder for Code/Text Generation.
☆192Updated 2 years ago
VatsaDev / nanoChatGPT
nanogpt turned into a chat model
☆68Updated last year
kabachuha / nanoGPKANT
Testing KAN-based text generation GPT models
☆16Updated last year
Oxen-AI / mamba-dive
This is the code that went into our practical dive using mamba as information extraction
☆54Updated last year
johnma2006 / candle
Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
☆51Updated last year
leloykun / llama2.cpp
Inference Llama 2 in one file of pure C++
☆83Updated last year
eniompw / nanoGPTshakespeare
finetuning shakespeare on karpathy/nanoGPT
☆19Updated 2 years ago
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆161Updated 8 months ago
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆76Updated 6 months ago
haocai1992 / GPT2-News-Classifier
A Streamlit app running GPT-2 language model for text classification, built with Pytorch, Transformers and AWS SageMaker.
☆39Updated 3 years ago
georgesung / LLM-WikipediaQA
Document Q&A on Wikipedia articles using LLMs
☆76Updated last year
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆40Updated last year
clabrugere / scratch-llm
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.
☆34Updated 3 months ago
vihangd / alpaca-qlora
Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA
☆81Updated last year
Eureka6174 / LearnNLPlan
Learning to Program with Natural Language
☆6Updated last year
tanchongmin / TensorFlow-Implementations
☆73Updated 7 months ago
ConiferLabsWA / flan-ul2-dolly
☆34Updated 2 years ago
viai957 / llama-inference
A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…
☆13Updated last year
rasbt / gradient-accumulation-blog
Finetuning BLOOM on a single GPU using gradient-accumulation
☆31Updated 2 years ago
SIC98 / GPT2-python-code-generator
GPT2 finetuning with transformers 🤗
☆28Updated 4 years ago