This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT) variant. The implementation focuses on the model architecture and the inference process. The code is restructured and heavily commented to facilitate easy understanding of the key parts of the architecture.
☆74Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for LLaMA2
Users that are interested in LLaMA2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Training and Fine-tuning an llm in Python and PyTorch.☆43Aug 30, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure Haskell (A port of llama2.c from Andrej Karpathy)☆14Oct 17, 2025Updated 6 months ago
- PyTorch Quantization Framework For OCP MX Datatypes.☆16May 30, 2025Updated 11 months ago
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 8 months ago
- My defense presentation☆10Mar 7, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆11Feb 3, 2025Updated last year
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 4 months ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆21Jun 29, 2024Updated last year
- docset for Dash containing MSDN content☆16Feb 19, 2017Updated 9 years ago
- Wrapper to easily generate the chat template for Llama2☆65Mar 10, 2024Updated 2 years ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆206Aug 23, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 9 months ago
- ☆29Dec 15, 2025Updated 4 months ago
- LLaMA 2 implemented from scratch in PyTorch☆369Sep 25, 2023Updated 2 years ago
- This Repo Contains Script To Fine Tune Open Source Models Using Unsloth by using UI with simple click and progress☆12Oct 3, 2024Updated last year
- Change Text Input Source by shortcut for OS X☆19May 9, 2022Updated 4 years ago
- Code for data reduction and analysis of Galaxy Zoo 2☆14May 20, 2016Updated 9 years ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated last year
- ☆15Jun 26, 2024Updated last year
- The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".☆21Mar 9, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Sep 4, 2025Updated 8 months ago
- This repository contain the simple llama3 implementation in pure jax.☆72Feb 17, 2025Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- Free chrome extension to summarize articles on the web using ChatGPT AI☆18Jan 7, 2023Updated 3 years ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- ☆19Feb 18, 2025Updated last year
- Repository for the research work "Ontology Generation using Large Language Models", presented at ESWC 2025.☆35Aug 15, 2025Updated 8 months ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- Small, simple agent task environments for training and evaluation☆19Nov 1, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- Orpheus-TTS local speech synthesizer written entirely in C#☆30Nov 25, 2025Updated 5 months ago
- A machine learning solution for extracting key entity values (weight, volume, dimensions) from product images.☆18Sep 17, 2024Updated last year
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated 2 years ago
- 🥪 Mess portal where owners can set their weekly menu, price, time, and students can purchase their desired coupons, with a QR code syste…☆11Jun 2, 2023Updated 2 years ago
- ☆14Jul 7, 2024Updated last year
- Gaussian Embedding of Large-scale Attributed Graphs☆10Mar 13, 2020Updated 6 years ago