This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT) variant. The implementation focuses on the model architecture and the inference process. The code is restructured and heavily commented to facilitate easy understanding of the key parts of the architecture.
☆74Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for LLaMA2
Users that are interested in LLaMA2 are comparing it to the libraries listed below
Sorting:
- Training and Fine-tuning an llm in Python and PyTorch.☆43Aug 30, 2023Updated 2 years ago
- generative models on toys☆12Sep 10, 2024Updated last year
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆11Apr 27, 2024Updated last year
- Repo with code for NIR'24 challange☆14Apr 22, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- working implimention of deepseek MLA☆45Jan 8, 2025Updated last year
- My defense presentation☆10Mar 7, 2022Updated 4 years ago
- ☆11Feb 3, 2025Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆203Aug 23, 2024Updated last year
- ☆29Dec 15, 2025Updated 3 months ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 7 months ago
- This Repo Contains Script To Fine Tune Open Source Models Using Unsloth by using UI with simple click and progress☆11Oct 3, 2024Updated last year
- LLaMA 2 implemented from scratch in PyTorch☆367Sep 25, 2023Updated 2 years ago
- Code for data reduction and analysis of Galaxy Zoo 2☆14May 20, 2016Updated 9 years ago
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆29Nov 25, 2025Updated 3 months ago
- The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".☆21Mar 9, 2023Updated 3 years ago
- Step by step explanation/tutorial of llama2.c☆226Oct 9, 2023Updated 2 years ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 10 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- Free chrome extension to summarize articles on the web using ChatGPT AI☆18Jan 7, 2023Updated 3 years ago
- D.Com 학우들을 위한 커리어 조언 Repo☆13May 17, 2023Updated 2 years ago
- ☆19Feb 18, 2025Updated last year
- JavaScript bindings for the ggml-js library☆45Nov 10, 2025Updated 4 months ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- A repository for easy note tracking and digestion.☆17Oct 6, 2019Updated 6 years ago
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆13Jan 9, 2026Updated 2 months ago
- 3D Telecommunications project utilizing Holoportation technology to provide live volumetric capture. Used in one case to increase the re…☆20Feb 20, 2026Updated last month
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- Orpheus-TTS local speech synthesizer written entirely in C#☆29Nov 25, 2025Updated 3 months ago
- ☆14Jul 7, 2024Updated last year
- Read and write tensorboard data using Rust☆24Feb 4, 2024Updated 2 years ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆69Mar 7, 2024Updated 2 years ago
- Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.☆18Mar 23, 2023Updated 2 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- A virtual machine implementation of "伟福" COP2000 development board (microinstruction level)☆17Dec 22, 2022Updated 3 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization☆172Nov 26, 2025Updated 3 months ago