iaalm / llama-api-server
A OpenAI API compatible REST server for llama.
☆199Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama-api-server
- An OpenAI-like LLaMA inference API☆111Updated last year
- Visual Studio Code extension for WizardCoder☆144Updated last year
- C++ implementation for 💫StarCoder☆446Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Updated last year
- Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…☆212Updated last year
- Harnessing the Memory Power of the Camelids☆145Updated last year
- ☆275Updated last year
- Local LLM ReAct Agent with Guidance☆155Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆114Updated last year
- TheBloke's Dockerfiles☆299Updated 8 months ago
- Provide a way to use the GPT-QLLama model as an API☆43Updated last year
- ☆136Updated 11 months ago
- Run any Large Language Model behind a unified API☆159Updated last year
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆324Updated last year
- A prompt/context management system☆165Updated last year
- A command-line interface to generate textual and conversational datasets with LLMs.☆293Updated last year
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆244Updated 9 months ago
- A fast batching API to serve LLM models☆172Updated 6 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- Python bindings for the C++ port of GPT4All-J model.☆38Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆89Updated 3 weeks ago
- An Autonomous LLM Agent that runs on Wizcoder-15B☆338Updated last month
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS☆92Updated last year
- A multimodal, function calling powered LLM webui.☆208Updated last month
- Use local llama LLM or openai to chat, discuss/summarize your documents, youtube videos, and so on.☆152Updated 6 months ago