c0sogi / llama-apiLinks

An OpenAI-like LLaMA inference API

☆112

Alternatives and similar repositories for llama-api

Users that are interested in llama-api are comparing it to the libraries listed below

Sorting:

ChuloAI / BrainChulo
Harnessing the Memory Power of the Camelids
☆146Updated last year
mzbac / wizardCoder-vsc
Visual Studio Code extension for WizardCoder
☆149Updated 2 years ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆185Updated last year
iaalm / llama-api-server
A OpenAI API compatible REST server for llama.
☆208Updated 5 months ago
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
QuangBK / localLLM_guidance
Local LLM ReAct Agent with Guidance
☆158Updated 2 years ago
kaiokendev / superbig
A prompt/context management system
☆170Updated 2 years ago
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆110Updated 2 years ago
mzbac / AutoGPTQ-API
Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.
☆91Updated 2 years ago
atisharma / llama_farm
Use local llama LLM or openai to chat, discuss/summarize your documents, youtube videos, and so on.
☆152Updated 7 months ago
ausboss / Local-LLM-Langchain
Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…
☆214Updated 2 years ago
TheBlokeAI / dockerLLM
TheBloke's Dockerfiles
☆305Updated last year
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆215Updated 10 months ago
OoriData / OgbujiPT
Client-side toolkit for using large language models, including where self-hosted
☆112Updated 8 months ago
nuance1979 / llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
☆128Updated 2 years ago
Itachi-Uchiha581 / Auto-Data
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
☆102Updated 9 months ago
paolorechia / learn-langchain
☆275Updated 2 years ago
Maximilian-Winter / AIRoleplay
Little AI roleplay program
☆59Updated last year
Dhaladom / TALIS
Simple and fast server for GPTQ-quantized LLaMA inference
☆24Updated 2 years ago
ChobPT / oobaboogas-webui-langchain_agent
Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work
☆74Updated last year
PygmalionAI / training-code
The code we currently use to fine-tune models.
☆114Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Updated last year
the-crypt-keeper / LLooM
Experimental LLM Inference UX to aid in creative writing
☆119Updated 7 months ago
sebaxzero / LangChain_PDFChat_Oobabooga
oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local
☆71Updated 2 years ago
Nuggt-dev / Nuggt
An Autonomous LLM Agent that runs on Wizcoder-15B
☆334Updated 9 months ago
nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆35Updated last year
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆246Updated last year
mzbac / GPTQ-for-LLaMa-API
Provide a way to use the GPT-QLLama model as an API
☆43Updated 2 years ago
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year