clabrugere / scratch-llmLinks
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.
☆37Updated 10 months ago
Alternatives and similar repositories for scratch-llm
Users that are interested in scratch-llm are comparing it to the libraries listed below
Sorting:
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- Manages vllm-nccl dependency☆17Updated last year
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆23Updated 10 months ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated 2 years ago
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Updated 2 years ago
- ☆17Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- PyTorch implementation of moe, which stands for mixture of experts☆51Updated 4 years ago
- Playground for Transformers☆53Updated 2 years ago
- implementing various transformer models for various tasks☆67Updated 3 years ago
- Fast and memory-efficient exact attention ported to rocm☆12Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated last week
- several types of attention modules written in PyTorch for learning purposes☆52Updated last week
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated 2 years ago
- Implementation of transformers based architecture in PyTorch.☆55Updated 5 years ago
- ☆59Updated 2 years ago
- ☆15Updated last year
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Updated last year
- A collection of reproducible inference engine benchmarks☆38Updated 8 months ago
- ☆21Updated 3 years ago
- Utilities for Training Very Large Models☆58Updated last year
- BCQ tutorial for transformers☆18Updated 2 years ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆50Updated last month
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Updated this week
- ML/DL Math and Method notes☆65Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.☆71Updated 2 years ago
- Create a source of truth for ML model results and browse it on Papers with Code☆34Updated 4 years ago