User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.
β337Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChatLLaMA π’ Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPTβ1,201Jan 18, 2025Updated last year
- Chat with Meta's LLaMA models at home made easyβ842Apr 2, 2023Updated 3 years ago
- Tools for formatting large language model prompts.β13Dec 19, 2023Updated 2 years ago
- LLaMA: Open and Efficient Foundation Language Modelsβ2,785Nov 8, 2023Updated 2 years ago
- β16Jan 8, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Quantized inference code for LLaMA modelsβ1,040Mar 17, 2023Updated 3 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findingsβ15May 3, 2023Updated 3 years ago
- Official repository for Fourier model that can generate periodic signalsβ10Mar 10, 2022Updated 4 years ago
- A fork of textgen that kept some things like Exllama and old GPTQ.β22Aug 20, 2024Updated last year
- ChatGLM-Peft-Tuningβ13Mar 19, 2023Updated 3 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learningβ12Aug 23, 2025Updated 8 months ago
- Quantized inference code for LLaMA modelsβ13Mar 12, 2023Updated 3 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,084Jul 1, 2025Updated 10 months ago
- β10Jul 24, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An easy-to-use implementation of Barlow Twins for Pytorch.β16May 16, 2021Updated 4 years ago
- Instruct-tune LLaMA on consumer hardwareβ18,937Jul 29, 2024Updated last year
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parametersβ5,928Mar 14, 2024Updated 2 years ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Modelβ575Jan 28, 2025Updated last year
- Simple UI for LLM Model Finetuningβ2,057Dec 21, 2023Updated 2 years ago
- β74Sep 5, 2023Updated 2 years ago
- β456Oct 15, 2023Updated 2 years ago
- A general purpose web app for connecting participants to engage in realtime conversations based on generated prompts.β20Jun 21, 2023Updated 2 years ago
- A collection of prompts for Llamaβ101Mar 23, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An open collection of methodologies to help with successful training of large language models.β559Feb 15, 2024Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabsβ37Sep 19, 2022Updated 3 years ago
- 4 bits quantization of LLaMA using GPTQβ3,072Jul 13, 2024Updated last year
- μ΄λ κ³ λ±νμμ μ¬νν νλ₯ λ‘ μ μ΅λ¬΄μ λ§λ€κΈ°β19Sep 2, 2023Updated 2 years ago
- Multi-language Enhanced LLaMAβ302Apr 13, 2023Updated 3 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batchesβ20Mar 31, 2023Updated 3 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,258Jul 17, 2024Updated last year
- Simple llama usage exampleβ47Mar 10, 2023Updated 3 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.β40Apr 17, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β14Dec 9, 2021Updated 4 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flβ¦β2,519Aug 13, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"β13Dec 1, 2024Updated last year
- These are excerpts from various blog posts related to AI & ML that deal with important research and business cases.β21Feb 17, 2023Updated 3 years ago
- High-speed download of LLaMA, Facebook's 65B parameter GPT modelβ4,131Jun 28, 2023Updated 2 years ago
- β28Oct 18, 2022Updated 3 years ago
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.β26Dec 1, 2022Updated 3 years ago