kennethleungty / Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆960Updated last year
Alternatives and similar repositories for Llama-2-Open-Source-LLM-CPU-Inference:
Users that are interested in Llama-2-Open-Source-LLM-CPU-Inference are comparing it to the libraries listed below
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,962Updated 11 months ago
- LLaMA v2 Chatbot☆1,403Updated last year
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,450Updated last year
- Evaluation tool for LLM QA chains☆1,070Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆819Updated last year
- ☆1,028Updated last year
- ⚡ Langchain apps in production using Jina & FastAPI☆1,618Updated last year
- Open-source tool to visualise your RAG 🔮☆1,111Updated 2 months ago
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,470Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆1,982Updated 2 months ago
- The Official Python Client for Lamini's API☆2,524Updated this week
- ☆757Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,843Updated last year
- A comprehensive guide to building RAG-based LLM applications for production.☆1,772Updated 7 months ago
- LLM(😽)☆1,660Updated last month
- ☆1,451Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆689Updated 10 months ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,626Updated last year
- ☆1,024Updated last year
- RayLLM - LLMs on Ray☆1,260Updated 9 months ago
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆300Updated 4 months ago
- Explore large language models in 512MB of RAM☆1,184Updated this week
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆784Updated 11 months ago
- LLM as a Chatbot Service☆3,305Updated last year
- Ship RAG based LLM web apps in seconds.☆986Updated last year
- Python package for easily interfacing with chat apps, with robust features and minimal code complexity.☆3,502Updated 8 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆42Updated last year
- ⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops☆916Updated 7 months ago
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆4,476Updated this week
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆1,918Updated last month