iaalm / llama-api-serverLinks

A OpenAI API compatible REST server for llama.

☆208

Alternatives and similar repositories for llama-api-server

Users that are interested in llama-api-server are comparing it to the libraries listed below

Sorting:

c0sogi / llama-api
An OpenAI-like LLaMA inference API
☆113Updated 2 years ago
TheBlokeAI / dockerLLM
TheBloke's Dockerfiles
☆306Updated last year
lhenault / simpleAI
An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.
☆332Updated last year
paolorechia / learn-langchain
☆275Updated 2 years ago
ChuloAI / BrainChulo
Harnessing the Memory Power of the Camelids
☆147Updated 2 years ago
Nuggt-dev / Nuggt
An Autonomous LLM Agent that runs on Wizcoder-15B
☆333Updated last year
LucienShui / huggingface-vscode-endpoint-server
starcoder server for huggingface-vscdoe custom endpoint
☆175Updated last year
mzbac / wizardCoder-vsc
Visual Studio Code extension for WizardCoder
☆148Updated 2 years ago
QuangBK / localLLM_guidance
Local LLM ReAct Agent with Guidance
☆158Updated 2 years ago
bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆455Updated 2 years ago
ausboss / Local-LLM-Langchain
Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…
☆212Updated 2 years ago
petals-infra / chat.petals.dev
💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
☆315Updated last year
idosal / AgentLLM
AgentLLM is a PoC for browser-native autonomous agents
☆444Updated 2 years ago
nuance1979 / llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
☆130Updated 2 years ago
atisharma / llama_farm
Use local llama LLM or openai to chat, discuss/summarize your documents, youtube videos, and so on.
☆153Updated 10 months ago
radi-cho / datasetGPT
A command-line interface to generate textual and conversational datasets with LLMs.
☆298Updated 2 years ago
imoneoi / openchat-ui
An open source UI for OpenChat models
☆287Updated last year
mzbac / AutoGPTQ-API
Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.
☆88Updated 2 years ago
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆247Updated last year
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆323Updated 2 years ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆188Updated last year
mzbac / GPTQ-for-LLaMa-API
Provide a way to use the GPT-QLLama model as an API
☆43Updated 2 years ago
shroominic / codebox-api
👾📦 CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.
☆353Updated 8 months ago
kaiokendev / superbig
A prompt/context management system
☆170Updated 2 years ago
1b5d / llm-api
Run any Large Language Model behind a unified API
☆170Updated last year
mzbac / qlora-fine-tune
☆166Updated 2 years ago
keldenl / gpt-llama.cpp
A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…
☆598Updated 2 years ago
rhohndorf / Auto-Llama-cpp
Uses Auto-GPT with Llama.cpp
☆387Updated last year
tensorchord / modelz-llm
OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
☆275Updated 2 years ago
BillSchumacher / Auto-Vicuna
☆137Updated 2 years ago