kennethleungty / Llama-2-Open-Source-LLM-CPU-InferenceLinks
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆963Updated last year
Alternatives and similar repositories for Llama-2-Open-Source-LLM-CPU-Inference
Users that are interested in Llama-2-Open-Source-LLM-CPU-Inference are comparing it to the libraries listed below
Sorting:
- LLaMA v2 Chatbot☆1,409Updated last year
- Open-source tool to visualise your RAG 🔮☆1,146Updated 6 months ago
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,956Updated last year
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,498Updated last year
- Ship RAG based LLM web apps in seconds.☆995Updated last year
- Run inference on MPT-30B using CPU☆575Updated 2 years ago
- Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet☆688Updated last year
- Evaluation tool for LLM QA chains☆1,079Updated 2 years ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,805Updated 11 months ago
- ⚡ Langchain apps in production using Jina & FastAPI☆1,632Updated last year
- Fine-Tuning Embedding for RAG with Synthetic Data☆504Updated last year
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆793Updated last year
- kani (カニ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)☆584Updated this week
- ⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops☆930Updated last year
- ☆769Updated 3 weeks ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,868Updated last year
- Decoupling Reasoning from Observations for Efficient Augmented Language Models☆909Updated last year
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,479Updated 2 months ago
- ☆1,489Updated last year
- ☆933Updated 7 months ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,259Updated 4 months ago
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,476Updated last year
- One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) 🌈🐂 Replace OpenAI GPT…☆752Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,005Updated 6 months ago
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,459Updated last year
- 🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.☆1,846Updated 2 months ago
- Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis d…☆524Updated last year
- Agent techniques to augment your LLM and push it beyong its limits☆1,579Updated last year
- ☆962Updated 10 months ago
- ☆275Updated 2 years ago