kennethleungty / Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
☆951Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Llama-2-Open-Source-LLM-CPU-Inference
- ⚡ Langchain apps in production using Jina & FastAPI☆1,609Updated last year
- Run inference on MPT-30B using CPU☆572Updated last year
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,456Updated 5 months ago
- Agent techniques to augment your LLM and push it beyong its limits☆1,541Updated 5 months ago
- Evaluation tool for LLM QA chains☆1,062Updated last year
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,450Updated last year
- Open-source tool to visualise your RAG 🔮☆1,083Updated 7 months ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,712Updated 3 months ago
- RayLLM - LLMs on Ray☆1,233Updated 5 months ago
- ⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops☆904Updated 3 months ago
- ☆275Updated last year
- The Official Python Client for Lamini's API☆2,519Updated 2 weeks ago
- ☆1,021Updated last year
- LLaMA v2 Chatbot☆1,391Updated last year
- Ship RAG based LLM web apps in seconds.☆974Updated 9 months ago
- ☆1,415Updated last year
- Fine-Tuning Embedding for RAG with Synthetic Data☆468Updated last year
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,161Updated 3 weeks ago
- 🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.☆1,704Updated this week
- ☆564Updated last year
- H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/☆4,002Updated last week
- ☆742Updated 10 months ago
- LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading a…☆1,114Updated 10 months ago
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,466Updated last year
- Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.☆785Updated last year
- Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks☆595Updated last year
- ☆297Updated 11 months ago
- Visualization and debugging tool for LangChain workflows☆721Updated 8 months ago
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆765Updated 7 months ago
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆706Updated 5 months ago