This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT) variant. The implementation focuses on the model architecture and the inference process. The code is restructured and heavily commented to facilitate easy understanding of the key parts of the architecture.
☆74Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for LLaMA2
Users that are interested in LLaMA2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo with code for NIR'24 challange☆14Apr 22, 2024Updated last year
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago
- My defense presentation☆10Mar 7, 2022Updated 4 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Sep 11, 2023Updated 2 years ago
- ☆12Jun 27, 2024Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 3 months ago
- docset for Dash containing MSDN content☆16Feb 19, 2017Updated 9 years ago
- Wrapper to easily generate the chat template for Llama2☆65Mar 10, 2024Updated 2 years ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 8 months ago
- LLaMA 2 implemented from scratch in PyTorch☆369Sep 25, 2023Updated 2 years ago
- Change Text Input Source by shortcut for OS X☆19May 9, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for "Automatic Circuit Finding and Faithfulness"☆17Jul 11, 2024Updated last year
- Step by step explanation/tutorial of llama2.c☆229Oct 9, 2023Updated 2 years ago
- An easy way to generate PDF files which could be imported into overleaf with python/matplotlib☆16May 31, 2020Updated 5 years ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- FGLA: Fast Generation-Based Gradient Leakage Attacks against Highly Compressed Gradients☆14Mar 17, 2026Updated 3 weeks ago
- ☆19Feb 18, 2025Updated last year
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 7 months ago
- JavaScript bindings for the ggml-js library☆45Nov 10, 2025Updated 5 months ago
- Code for Personalized Large Language Models via Selective Prompt Tuning☆10Jun 26, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- Orpheus-TTS local speech synthesizer written entirely in C#☆28Nov 25, 2025Updated 4 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated last month
- ☆14Jul 7, 2024Updated last year
- Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ☆15Jul 5, 2025Updated 9 months ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆69Mar 7, 2024Updated 2 years ago
- Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.☆18Mar 23, 2023Updated 3 years ago
- ☆10Aug 13, 2021Updated 4 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization☆170Nov 26, 2025Updated 4 months ago
- [EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…☆15Mar 28, 2025Updated last year
- utilizing GAN and AutoEncoder for denoising☆10Nov 26, 2019Updated 6 years ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- MARNNs Can Learn Generalized Dyck Languages☆12Nov 11, 2019Updated 6 years ago
- ☆14Dec 9, 2020Updated 5 years ago
- A ready to use boilerplate Flask App for Data Scientist, ML engineer...☆16Jan 31, 2023Updated 3 years ago