kennethleungty / Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆950Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Llama-2-Open-Source-LLM-CPU-Inference
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,814Updated 9 months ago
- Run inference on MPT-30B using CPU☆573Updated last year
- ⚡ Langchain apps in production using Jina & FastAPI☆1,613Updated last year
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,468Updated last year
- 🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.☆1,714Updated 2 weeks ago
- LLaMA v2 Chatbot☆1,395Updated last year
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,964Updated 7 months ago
- Agent techniques to augment your LLM and push it beyong its limits☆1,545Updated 5 months ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,714Updated 3 months ago
- Customizable implementation of the self-instruct paper.☆1,024Updated 8 months ago
- Simple UI for LLM Model Finetuning☆2,045Updated 10 months ago
- A school for camelids☆1,208Updated last year
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,458Updated 6 months ago
- A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.☆1,379Updated 2 months ago
- The web framework for building LLM microservices☆976Updated 4 months ago
- ☆744Updated 10 months ago
- ☆1,421Updated last year
- Ship RAG based LLM web apps in seconds.☆976Updated 9 months ago
- AutoChain: Build lightweight, extensible, and testable LLM Agents☆1,792Updated 5 months ago
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆785Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆812Updated last year
- ⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops☆906Updated 4 months ago
- Visualization and debugging tool for LangChain workflows☆723Updated 8 months ago
- C++ implementation for 💫StarCoder☆446Updated last year
- ☆1,022Updated 10 months ago
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,455Updated 8 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆941Updated last month
- ☆782Updated 2 months ago
- LLM as a Chatbot Service☆3,296Updated last year
- Evaluation tool for LLM QA chains☆1,063Updated last year