karpathy / minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
β20,810Updated 5 months ago
Alternatives and similar repositories for minGPT:
Users that are interested in minGPT are comparing it to the libraries listed below
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β38,486Updated last month
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β16,978Updated this week
- Ongoing research training transformer models at scaleβ11,109Updated this week
- π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.β137,641Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ20,584Updated last week
- Fast and memory-efficient exact attentionβ15,064Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β11,119Updated last month
- Code for the paper "Language Models are Unsupervised Multitask Learners"β22,792Updated 5 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β36,255Updated this week
- Train transformer language models with reinforcement learning.β10,609Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β30,809Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β8,178Updated this week
- Development repository for the Triton language and compilerβ14,042Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.β10,479Updated last month
- Inference Llama 2 in one file of pure Cβ17,858Updated 5 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)β7,075Updated last year
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)β8,655Updated this week
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like APIβ10,914Updated 5 months ago
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β35,268Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ26,971Updated 5 months ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ30,985Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.β11,197Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ33,809Updated this week
- Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.β28,807Updated last week
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,168Updated 7 months ago
- A library for efficient similarity search and clustering of dense vectors.β32,387Updated this week
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)β¦β13,005Updated last week
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterβ¦β13,831Updated 5 months ago
- Trax β Deep Learning with Clear Code and Speedβ8,136Updated this week
- The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.β10,056Updated last week