karpathy / minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆19,845Updated last month
Related projects: ⓘ
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆36,216Updated last month
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆19,545Updated 3 weeks ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆30,165Updated last week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆34,719Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆32,206Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆29,930Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.☆25,168Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆29,346Updated 2 months ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆132,113Updated this week
- Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.☆27,963Updated this week
- Inference code for Llama models☆55,482Updated last month
- Inference Llama 2 in one file of pure C☆17,153Updated last month
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆24,723Updated last month
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆22,256Updated last month
- Google Research☆33,807Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆15,839Updated this week
- Ongoing research training transformer models at scale☆9,949Updated this week
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆10,011Updated last month
- Awesome-LLM: a curated list of Large Language Model☆17,413Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆30,511Updated this week
- Making large AI models cheaper, faster and more accessible☆38,614Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆36,446Updated this week
- Fast and memory-efficient exact attention☆13,401Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆26,822Updated this week
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆12,397Updated 2 weeks ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆11,796Updated last month
- Development repository for the Triton language and compiler☆12,698Updated this week
- Instruct-tune LLaMA on consumer hardware☆18,537Updated last month
- LlamaIndex is a data framework for your LLM applications☆35,450Updated this week
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆26,143Updated this week