meta-llama / llamaLinks
Inference code for Llama models
☆59,002Updated 11 months ago
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below
Sorting:
- Inference code for CodeLlama models☆16,372Updated last year
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,268Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,332Updated 6 months ago
- LLM inference in C/C++☆92,005Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆46,055Updated this week
- Instruct-tune LLaMA on consumer hardware☆18,986Updated last year
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆20,347Updated last week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆13,106Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,069Updated last week
- Universal LLM Deployment Engine with ML Compilation☆21,777Updated this week
- The official Meta Llama 3 GitHub site☆29,148Updated 11 months ago
- 🦜🔗 The platform for reliable agents.☆122,644Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,795Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,220Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,121Updated last month
- Large Language Model Text Generation Inference☆10,711Updated last week
- Open-source search and retrieval database for AI applications.☆25,144Updated this week
- The definitive Web UI for local AI, with powerful features and easy setup.☆45,706Updated last week
- Inference Llama 2 in one file of pure C☆19,046Updated last year
- Official inference library for Mistral models☆10,606Updated last month
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf☆24,500Updated 5 months ago
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,970Updated 5 months ago
- Fast and memory-efficient exact attention☆21,317Updated this week
- StableLM: Stability AI Language Models☆15,785Updated last year
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,255Updated last week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆21,915Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆66,313Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆49,952Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆16,870Updated 2 months ago
- Making large AI models cheaper, faster and more accessible☆41,310Updated last week