clabrugere / scratch-llm
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.
☆32Updated 8 months ago
Alternatives and similar repositories for scratch-llm:
Users that are interested in scratch-llm are comparing it to the libraries listed below
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated 8 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated last month
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆37Updated 2 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 2 years ago
- Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratch☆40Updated last year
- Scripts for text classification with llama and bert☆9Updated last month
- ☆21Updated 3 years ago
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- A gzip-based text-classification system.☆32Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆35Updated this week
- Mixtral finetuning☆19Updated 11 months ago
- LLaMA implementation for HuggingFace Transformers☆38Updated last year
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆33Updated last year
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆20Updated 6 months ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆27Updated last year
- ☆14Updated 7 months ago
- Reward Model framework for LLM RLHF☆58Updated last year
- Inference Llama 2 in one file of pure C++☆81Updated last year
- Transforming textual descriptions into process models using deep learning☆13Updated 5 years ago
- ☆17Updated last year
- several types of attention modules written in PyTorch for learning purposes☆43Updated 3 months ago
- Finetuning BLOOM on a single GPU using gradient-accumulation☆27Updated last year
- ☆18Updated 11 months ago
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆38Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆124Updated 8 months ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆141Updated 9 months ago
- How to export Hugging Face's 🤗 NLP Transformers models to ONNX and use the exported model with the appropriate Transformers pipeline.☆24Updated 2 years ago
- ☆37Updated last year
- JAX implementations of RWKV☆19Updated last year
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago