clabrugere / scratch-llm
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.
☆29Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for scratch-llm
- Tiny C++11 GPT-2 inference implementation from scratch☆46Updated 10 months ago
- Manages vllm-nccl dependency☆17Updated 5 months ago
- The fastai book, 2nd edition (in progress)☆47Updated 4 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆36Updated 3 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆100Updated 2 months ago
- ☆17Updated this week
- Reward Model framework for LLM RLHF☆58Updated last year
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆13Updated 6 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆111Updated 6 months ago
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆32Updated last month
- Training a BERT model from scratch.☆10Updated last year
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆122Updated 7 months ago
- ☆35Updated last year
- 7 query strategies for navigating knowledge graphs with LlamaIndex☆40Updated last year
- ☆18Updated 8 months ago
- ☆31Updated 3 weeks ago
- Having fun with ML☆11Updated 7 months ago
- Training and Fine-tuning an llm in Python and PyTorch.☆41Updated last year
- Official repository for RAGVIZ: Diagnose and Visualize Retrieval-Augmented Generation☆21Updated this week
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆22Updated this week
- ☆44Updated 2 months ago
- Learn Generative AI with PyTorch (Manning Publications, 2024)☆47Updated 3 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆51Updated last month
- ☆22Updated 3 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…☆54Updated last year
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆33Updated 2 years ago
- aigc evals☆10Updated 11 months ago
- ☆13Updated last year