kennethleungty / Llama-2-Open-Source-LLM-CPU-InferenceLinks
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆967Updated 2 years ago
Alternatives and similar repositories for Llama-2-Open-Source-LLM-CPU-Inference
Users that are interested in Llama-2-Open-Source-LLM-CPU-Inference are comparing it to the libraries listed below
Sorting:
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,954Updated last year
- Ship RAG based LLM web apps in seconds.☆998Updated last year
- Run inference on MPT-30B using CPU☆575Updated 2 years ago
- Open-source tool to visualise your RAG 🔮☆1,174Updated 10 months ago
- LLaMA v2 Chatbot☆1,411Updated 2 years ago
- ⚡ Langchain apps in production using Jina & FastAPI☆1,632Updated 2 years ago
- Evaluation tool for LLM QA chains☆1,086Updated 2 years ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,875Updated last year
- The web framework for building LLM microservices [deprecated]☆993Updated last year
- ☆1,508Updated 2 years ago
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,502Updated 2 years ago
- Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet☆686Updated 2 years ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,838Updated last year
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆797Updated 2 years ago
- Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"☆712Updated last year
- ⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops☆933Updated last year
- ☆275Updated 2 years ago
- Chat with your documents offline using AI.☆735Updated 2 years ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,007Updated 10 months ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,262Updated 7 months ago
- This repository provides very basic flask, streamlit, and docker examples for the llama_index (fka gpt_index) package☆631Updated last year
- ☆774Updated 4 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆514Updated 2 years ago
- Awesome things you can do with ChatGPT + Code Interpreter combo 🔥☆1,017Updated last year
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,462Updated 2 years ago
- Scale LLM Engine public repository☆814Updated last week
- ☆966Updated 2 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,630Updated 2 years ago
- Agent techniques to augment your LLM and push it beyong its limits☆1,585Updated last year
- UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT…☆475Updated 2 years ago