clabrugere / scratch-llmLinks
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.
☆37Updated 8 months ago
Alternatives and similar repositories for scratch-llm
Users that are interested in scratch-llm are comparing it to the libraries listed below
Sorting:
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆22Updated 8 months ago
- Manages vllm-nccl dependency☆17Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆43Updated last week
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- a curated list of the role of small models in the LLM era☆107Updated last year
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆91Updated 2 years ago
- Make triton easier☆48Updated last year
- ☆17Updated last year
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Updated last year
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated last year
- ☆14Updated 2 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆100Updated 11 months ago
- Utilities for Training Very Large Models☆58Updated last year
- A collection of reproducible inference engine benchmarks☆37Updated 6 months ago
- ☆21Updated 3 years ago
- A memory efficient DLRM training solution using ColossalAI☆106Updated 2 years ago
- Visualising Losses in Deep Neural Networks☆16Updated last year
- benchmarking some transformer deployments☆26Updated 2 years ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated 2 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆82Updated 2 years ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆56Updated 2 weeks ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆21Updated 4 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Updated 2 years ago
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding. Published as "Tree-Based Represent…☆40Updated 2 years ago
- ML/DL Math and Method notes☆64Updated last year
- implementing various transformer models for various tasks☆66Updated 3 years ago