clabrugere / scratch-llm
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.
☆33Updated last week
Alternatives and similar repositories for scratch-llm:
Users that are interested in scratch-llm are comparing it to the libraries listed below
- Tiny C++11 GPT-2 inference implementation from scratch☆55Updated last month
- An assignment for building an NLP system from scratch.☆24Updated 11 months ago
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…☆60Updated last year
- PromptCraft is a prompt perturbation toolkit from the character, word, and sentence levels for prompt robustness analysis. PyPI Package: …☆14Updated last year
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- setup the env for vllm users☆16Updated last year
- Using ChatGPT to select interesting arXiv papers☆13Updated 2 weeks ago
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated 9 months ago
- ☆18Updated last year
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- Inference Llama 2 in C++☆45Updated 9 months ago
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆20Updated last week
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆141Updated 10 months ago
- nanogpt turned into a chat model☆65Updated last year
- This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.☆48Updated last year
- A Python implementation of Toolformer using Huggingface Transformers☆15Updated last year
- ☆17Updated last year
- ☆30Updated 2 years ago
- Manages vllm-nccl dependency☆17Updated 8 months ago
- ☆36Updated last year
- finetuning shakespeare on karpathy/nanoGPT☆17Updated 2 years ago
- Inference Llama 2 in one file of pure C++☆81Updated last year
- Playground for Transformers☆48Updated last year
- Training a BERT model from scratch.☆10Updated last year
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated last year
- ⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems☆19Updated last year
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 2 months ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Updated 2 years ago
- ML/DL Math and Method notes☆58Updated last year
- ☆45Updated 3 months ago