clabrugere / scratch-llmLinks

Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.

☆37

Alternatives and similar repositories for scratch-llm

Users that are interested in scratch-llm are comparing it to the libraries listed below

Sorting:

Hemanthkumar2112 / Reward-Modeling-RLHF-Finetune-and-RAG
Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform
☆22Updated 5 months ago
tensorchord / inference-benchmark
Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)
☆28Updated 2 years ago
sahibpreetsingh12 / llm-learning
☆14Updated last year
kailums / flash-attention-rocm
Fast and memory-efficient exact attention ported to rocm
☆11Updated last year
geronimi73 / 3090_shorts
minimal scripts for 24GB VRAM GPUs. training, inference, whatever
☆41Updated last month
rasbt / pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…
☆92Updated 2 years ago
lucasjinreal / wnnx_models
Various test models in WNNX format. It can view with `pip install wnetron && wnetron`
☆12Updated 3 years ago
vllm-project / vllm-nccl
Manages vllm-nccl dependency
☆17Updated last year
shrimantasatpati / Microsoft-Phi-2-Streamlit
Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…
☆14Updated last year
Montinger / Transformer-Workbench
Playground for Transformers
☆51Updated last year
Michaelvll / llm-ie-benchmarks
A collection of reproducible inference engine benchmarks
☆32Updated 3 months ago
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated 3 weeks ago
UmerHA / triton_util
Make triton easier
☆47Updated last year
knotgrass / attention
several types of attention modules written in PyTorch for learning purposes
☆53Updated 9 months ago
FrancescoSaverioZuppichini / pytorch-2.0-benchmark
Benchmarking PyTorch 2.0 different models
☆21Updated 2 years ago
facebookresearch / tce
Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.
☆13Updated last year
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated 9 months ago
tigerchen52 / awesome_role_of_small_models
a curated list of the role of small models in the LLM era
☆102Updated 10 months ago
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆47Updated last year
paperswithcode / model-index
Create a source of truth for ML model results and browse it on Papers with Code
☆32Updated 4 years ago
leloykun / llama2.cpp
Inference Llama 2 in one file of pure C++
☆83Updated last year
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆55Updated 3 weeks ago
GreenBitAI / low_bit_llama
Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs
☆110Updated last year
tcapelle / mixtral
Mixtral finetuning
☆19Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
zaydzuhri / pythia-mlkv
Multi-Layer Key-Value sharing experiments on Pythia models
☆33Updated last year
facebookresearch / coocmap
code for paper "Accessing higher dimensions for unsupervised word translation"
☆21Updated 2 years ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
MILVLG / mlc-imp
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
☆10Updated last year
facebookresearch / lss_eval
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Updated last year