kennethleungty / Llama-2-Open-Source-LLM-CPU-InferenceLinks
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆969Updated 2 years ago
Alternatives and similar repositories for Llama-2-Open-Source-LLM-CPU-Inference
Users that are interested in Llama-2-Open-Source-LLM-CPU-Inference are comparing it to the libraries listed below
Sorting:
- Ship RAG based LLM web apps in seconds.☆1,002Updated last year
- Evaluation tool for LLM QA chains☆1,088Updated 2 years ago
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,506Updated 2 years ago
- Run inference on MPT-30B using CPU☆576Updated 2 years ago
- LLaMA v2 Chatbot☆1,413Updated 2 years ago
- Open-source tool to visualise your RAG 🔮☆1,199Updated 10 months ago
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,957Updated last year
- ☆778Updated 5 months ago
- ⚡ Langchain apps in production using Jina & FastAPI☆1,635Updated 2 years ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,479Updated 7 months ago
- The web framework for building LLM microservices [deprecated]☆994Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,008Updated 11 months ago
- Agent techniques to augment your LLM and push it beyong its limits☆1,585Updated last year
- Awesome things you can do with ChatGPT + Code Interpreter combo 🔥☆1,018Updated last year
- A comprehensive guide to building RAG-based LLM applications for production.☆1,841Updated last year
- 💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, …☆2,450Updated 2 weeks ago
- ☆1,048Updated 2 years ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆824Updated 2 years ago
- Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet☆688Updated 2 years ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆518Updated 2 years ago
- Decoupling Reasoning from Observations for Efficient Augmented Language Models☆927Updated 2 years ago
- One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) 🌈🐂 Replace OpenAI GPT…☆754Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,877Updated last year
- kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)☆594Updated 2 weeks ago
- ⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops☆935Updated last year
- Scale LLM Engine public repository☆814Updated this week
- Chat and Ask on your own data. Accelerator to quickly upload your own enterprise data and use OpenAI services to chat to that uploaded d…☆869Updated 10 months ago
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆798Updated 2 years ago
- LLM(😽)☆1,692Updated 9 months ago
- ☆276Updated 2 years ago