liltom-eth / llama2-webui
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.
☆1,959Updated last year
Alternatives and similar repositories for llama2-webui:
Users that are interested in llama2-webui are comparing it to the libraries listed below
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,863Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,818Updated 2 weeks ago
- Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A☆960Updated last year
- The Official Python Client for Lamini's API☆2,530Updated 2 weeks ago
- 4 bits quantization of LLaMA using GPTQ☆3,052Updated 9 months ago
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,509Updated last month
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,631Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,381Updated 8 months ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆1,990Updated 3 months ago
- LLaMA v2 Chatbot☆1,406Updated last year
- An Open-source Toolkit for LLM Development☆2,774Updated 3 months ago
- LLaMA: Open and Efficient Foundation Language Models☆2,802Updated last year
- LLM as a Chatbot Service☆3,316Updated last year
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,131Updated this week
- A colab gradio web UI for running Large Language Models☆2,100Updated last year
- Run inference on MPT-30B using CPU☆575Updated last year
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,863Updated last year
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,476Updated last year
- ☆1,468Updated last year
- H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/☆4,278Updated 2 weeks ago
- ☆1,468Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,859Updated last year
- Instruction Tuning with GPT-4☆4,301Updated last year
- A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…☆598Updated last year
- ⚡ Langchain apps in production using Jina & FastAPI☆1,631Updated last year
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,477Updated last year
- Open Multilingual Chatbot for Everyone☆1,257Updated 11 months ago
- Python package for easily interfacing with chat apps, with robust features and minimal code complexity.☆3,511Updated 9 months ago
- Large Language Model Text Generation Inference☆10,052Updated this week
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,170Updated last year