clabrugere / scratch-llmLinks
Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.
☆38Updated 11 months ago
Alternatives and similar repositories for scratch-llm
Users that are interested in scratch-llm are comparing it to the libraries listed below
Sorting:
- Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform☆22Updated 11 months ago
- Manages vllm-nccl dependency☆17Updated last year
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆38Updated 9 months ago
- Demo of Role-Based Access Control in LLM Vector Databases☆16Updated 2 years ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆50Updated 3 weeks ago
- Fast and memory-efficient exact attention ported to rocm☆13Updated 2 years ago
- ☆17Updated last year
- Multi-Layer Key-Value sharing experiments on Pythia models☆34Updated last year
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated 2 years ago
- several types of attention modules written in PyTorch for learning purposes☆52Updated 3 weeks ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated 2 years ago
- a curated list of the role of small models in the LLM era☆111Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Updated last week
- Creating the DeepSeek V3 model from scratch☆24Updated 10 months ago
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challenge☆60Updated last year
- Make triton easier☆50Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆66Updated last year
- Open Implementations of LLM Analyses☆107Updated last year
- Create a source of truth for ML model results and browse it on Papers with Code☆34Updated 4 years ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆64Updated this week
- Utilities for Training Very Large Models☆58Updated last year
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Updated 2 years ago
- PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References☆20Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆22Updated 2 years ago
- ☆15Updated last year
- ML/DL Math and Method notes☆66Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago