Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆974Nov 6, 2023Updated 2 years ago
Alternatives and similar repositories for Llama-2-Open-Source-LLM-CPU-Inference
Users that are interested in Llama-2-Open-Source-LLM-CPU-Inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,946Mar 22, 2024Updated 2 years ago
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,464Nov 7, 2023Updated 2 years ago
- Run inference on MPT-30B using CPU☆576Jun 30, 2023Updated 2 years ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,013Dec 29, 2024Updated last year
- 开源社区第一个能下载、能运行的中文 LLaMA2 模型!☆2,220Oct 26, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"☆1,214Oct 22, 2023Updated 2 years ago
- Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.☆22,210Mar 10, 2026Updated 2 weeks ago
- 🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation☆7,172Feb 10, 2025Updated last year
- An Open-source Toolkit for LLM Development☆2,806Jan 13, 2025Updated last year
- Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. D…☆12,004Oct 9, 2025Updated 5 months ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,479May 1, 2025Updated 10 months ago
- Python package for easily interfacing with chat apps, with robust features and minimal code complexity.☆3,512Jul 3, 2024Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,478Jun 7, 2025Updated 9 months ago
- LLaMA v2 Chatbot☆1,415Aug 27, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Inference Llama 2 in one file of pure C☆19,302Aug 6, 2024Updated last year
- Large Language Model Text Generation Inference☆10,812Jan 8, 2026Updated 2 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,774Mar 11, 2026Updated 2 weeks ago
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,506Sep 11, 2023Updated 2 years ago
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆5,559May 21, 2025Updated 10 months ago
- TypeChat is a library that makes it easy to build natural language interfaces using types.☆8,634Mar 10, 2026Updated 2 weeks ago
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app …☆6,484Mar 11, 2026Updated 2 weeks ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,261Mar 3, 2026Updated 3 weeks ago
- The no-code platform for building custom LLM Agents☆2,941Jun 17, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 👾 Open source implementation of the ChatGPT Code Interpreter☆3,859Nov 7, 2024Updated last year
- Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR …☆1,707Feb 3, 2025Updated last year
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,887Sep 26, 2024Updated last year
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,475Sep 13, 2024Updated last year
- Universal LLM Deployment Engine with ML Compilation☆22,246Mar 18, 2026Updated last week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆10,020Sep 7, 2024Updated last year
- ☆2,559Jan 7, 2025Updated last year
- Open Source AI Platform - AI Chat with advanced features that works with every LLM☆17,988Updated this week
- Explore large language models in 512MB of RAM☆1,197Feb 19, 2026Updated last month
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆1,061May 29, 2023Updated 2 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,922May 3, 2024Updated last year
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,618Dec 11, 2024Updated last year
- kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)☆599Mar 17, 2026Updated last week
- 🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation w…☆6,208Jan 20, 2026Updated 2 months ago
- Awesome things you can do with ChatGPT + Code Interpreter combo 🔥☆1,017Dec 10, 2023Updated 2 years ago
- AI companions with memory: a lightweight stack to create and host your own AI companions☆5,943Apr 23, 2024Updated last year