This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT) variant. The implementation focuses on the model architecture and the inference process. The code is restructured and heavily commented to facilitate easy understanding of the key parts of the architecture.
☆75Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for LLaMA2
Users that are interested in LLaMA2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training and Fine-tuning an llm in Python and PyTorch.☆43Aug 30, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure Haskell (A port of llama2.c from Andrej Karpathy)☆14Oct 17, 2025Updated 7 months ago
- Repo with code for NIR'24 challange☆14Apr 22, 2024Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- PyTorch Quantization Framework For OCP MX Datatypes.☆16May 30, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 9 months ago
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 3 years ago
- working implimention of deepseek MLA☆44Jan 8, 2025Updated last year
- My defense presentation☆10Mar 7, 2022Updated 4 years ago
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- ☆12Jun 27, 2024Updated last year
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 5 months ago
- direct ui engine for windows platform☆10Dec 10, 2015Updated 10 years ago
- docset for Dash containing MSDN content☆16Feb 19, 2017Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Wrapper to easily generate the chat template for Llama2☆65Mar 10, 2024Updated 2 years ago
- ☆29Dec 15, 2025Updated 5 months ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 9 months ago
- LLaMA 2 implemented from scratch in PyTorch☆368Sep 25, 2023Updated 2 years ago
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆24Nov 25, 2025Updated 6 months ago
- This Repo Contains Script To Fine Tune Open Source Models Using Unsloth by using UI with simple click and progress☆12Oct 3, 2024Updated last year
- Code for data reduction and analysis of Galaxy Zoo 2☆14May 20, 2016Updated 10 years ago
- Code for "Automatic Circuit Finding and Faithfulness"☆17Jul 11, 2024Updated last year
- ☆15Jun 26, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Step by step explanation/tutorial of llama2.c☆233Oct 9, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Sep 4, 2025Updated 8 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- An easy way to generate PDF files which could be imported into overleaf with python/matplotlib☆16May 31, 2020Updated 5 years ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- ☆19Feb 18, 2025Updated last year
- ☆11Jan 24, 2025Updated last year
- JavaScript bindings for the ggml-js library☆44Nov 10, 2025Updated 6 months ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- Small, simple agent task environments for training and evaluation☆19Nov 1, 2024Updated last year
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Jan 14, 2026Updated 4 months ago
- Orpheus-TTS local speech synthesizer written entirely in C#☆30Nov 25, 2025Updated 6 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 months ago
- A repository of the latest work related to underwater image enhancement (awaiting continuous updates). It provides relevant underwater im…☆22May 14, 2026Updated 2 weeks ago
- ☆14Jul 7, 2024Updated last year