An OpenAI-like LLaMA inference API
☆113Sep 17, 2023Updated 2 years ago
Alternatives and similar repositories for llama-api
Users that are interested in llama-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Chat is an open-source serverless alternative to ChatGPT.☆36Sep 13, 2024Updated last year
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- vits2 backbone with multilingual-bert(한국어 지원)☆27Apr 6, 2024Updated 2 years ago
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Aug 2, 2023Updated 2 years ago
- A Python package designed to simplify the process of creating and managing function calls to OpenAI's API, as well as models using LiteLL…☆17May 25, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,921Sep 30, 2023Updated 2 years ago
- ☆136May 3, 2026Updated 2 weeks ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Tra…☆1,286Jan 24, 2024Updated 2 years ago
- Gugugo: 한국어 오픈소스 번역 모델 프로젝트☆84Apr 7, 2024Updated 2 years ago
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- ☆16Dec 16, 2024Updated last year
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆98Apr 2, 2026Updated last month
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Dec 8, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Wheels for llama-cpp-python compiled with cuBLAS support☆103Feb 1, 2024Updated 2 years ago
- An omnipowerful personal assistant powered by LLMs, Zapier NLA, and custom actions.☆15Sep 13, 2024Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆266Mar 6, 2025Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 4 months ago
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆24Aug 1, 2024Updated last year
- Oobabooga extension for Bark TTS☆119Nov 23, 2023Updated 2 years ago
- An extension to oobabooga's TextGen allowing you to receive pics generated by Automatic1111's SD API☆12May 16, 2023Updated 3 years ago
- The one who calls upon functions - Function-Calling Language Model☆36Oct 2, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Train LLMs by just modifying config files!☆24Nov 23, 2023Updated 2 years ago
- ☆40Aug 26, 2025Updated 8 months ago
- OpenOrca-KO dataset을 활용하여 llama2를 fine-tuning한 Korean-OpenOrca☆18Nov 1, 2023Updated 2 years ago
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆17Apr 8, 2026Updated last month
- XTTSv2 Extension for oobabooga text-generation-webui☆157Nov 21, 2023Updated 2 years ago
- jQuery, React and Streamlit applications written by LLMs☆15Dec 24, 2023Updated 2 years ago
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- This package provides Swift bindings for llama.cpp☆26Apr 4, 2023Updated 3 years ago
- A frontend for creative writing with LLMs☆160Jul 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CodeUp: A Multilingual Code Generation Llama-X Model with Parameter-Efficient Instruction-Tuning☆127Dec 25, 2024Updated last year
- A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…☆26Aug 16, 2024Updated last year
- Web UI for ExLlamaV2☆511Feb 5, 2025Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆368Dec 9, 2023Updated 2 years ago
- Convert a ChatGPT export into an Open-WebUI importable JSON.☆37Aug 11, 2025Updated 9 months ago
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,219May 14, 2026Updated last week
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆17Jul 15, 2023Updated 2 years ago