kennethleungty / Llama-2-Open-Source-LLM-CPU-InferenceLinks
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆967Updated last year
Alternatives and similar repositories for Llama-2-Open-Source-LLM-CPU-Inference
Users that are interested in Llama-2-Open-Source-LLM-CPU-Inference are comparing it to the libraries listed below
Sorting:
- Ship RAG based LLM web apps in seconds.☆996Updated last year
- LLaMA v2 Chatbot☆1,413Updated 2 years ago
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,956Updated last year
- Run inference on MPT-30B using CPU☆575Updated 2 years ago
- Open-source tool to visualise your RAG 🔮☆1,169Updated 9 months ago
- Evaluation tool for LLM QA chains☆1,087Updated 2 years ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,835Updated last year
- ⚡ Langchain apps in production using Jina & FastAPI☆1,633Updated 2 years ago
- LLM as a Chatbot Service☆3,342Updated last year
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,502Updated 2 years ago
- ☆1,506Updated last year
- Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet☆686Updated 2 years ago
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,460Updated last year
- Agent techniques to augment your LLM and push it beyong its limits☆1,584Updated last year
- Awesome things you can do with ChatGPT + Code Interpreter combo 🔥☆1,015Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,007Updated 9 months ago
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆796Updated 2 years ago
- ☆275Updated 2 years ago
- The Official Python Client for Lamini's API☆2,543Updated 6 months ago
- Official supported Python bindings for llama.cpp + gpt4all☆1,015Updated 2 years ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆512Updated 2 years ago
- The web framework for building LLM microservices [deprecated]☆994Updated last year
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,477Updated 5 months ago
- Explore large language models in 512MB of RAM☆1,194Updated 2 months ago
- ☆774Updated 3 months ago
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆729Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,876Updated last year
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,476Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated 2 years ago
- ☆964Updated last year