kennethleungty / Llama-2-Open-Source-LLM-CPU-InferenceLinks
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆962Updated last year
Alternatives and similar repositories for Llama-2-Open-Source-LLM-CPU-Inference
Users that are interested in Llama-2-Open-Source-LLM-CPU-Inference are comparing it to the libraries listed below
Sorting:
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,957Updated last year
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,477Updated last month
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,866Updated last year
- Evaluation tool for LLM QA chains☆1,070Updated 2 years ago
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆791Updated last year
- Run inference on MPT-30B using CPU☆575Updated last year
- Explore large language models in 512MB of RAM☆1,192Updated 3 months ago
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆307Updated 7 months ago
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,493Updated last year
- ☆1,482Updated last year
- Agent techniques to augment your LLM and push it beyong its limits☆1,578Updated last year
- C++ implementation for 💫StarCoder☆452Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,878Updated last year
- Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet☆686Updated last year
- ⚡ Langchain apps in production using Jina & FastAPI☆1,632Updated last year
- A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…☆598Updated last year
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆721Updated last year
- Ship RAG based LLM web apps in seconds.☆992Updated last year
- ☆453Updated last year
- Inference Llama 2 in one file of pure 🔥☆2,111Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆1,993Updated 5 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆500Updated last year
- LLaMA v2 Chatbot☆1,411Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆587Updated last year
- Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sour…☆2,652Updated 8 months ago
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,477Updated last year
- Customizable implementation of the self-instruct paper.☆1,043Updated last year
- ☆1,032Updated 2 years ago
- ☆766Updated last year
- Tune any FALCON in 4-bit☆467Updated last year