An OpenAI-like LLaMA inference API
☆113Sep 17, 2023Updated 2 years ago
Alternatives and similar repositories for llama-api
Users that are interested in llama-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Chat is an open-source serverless alternative to ChatGPT.☆36Sep 13, 2024Updated last year
- ☆19Jul 23, 2023Updated 2 years ago
- StrategyQA 데이터 세트 번역☆22Apr 12, 2024Updated 2 years ago
- The accompany backend for PAI app☆12Mar 24, 2025Updated last year
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- vits2 backbone with multilingual-bert(한국어 지원)☆27Apr 6, 2024Updated 2 years ago
- oobabooga extension - Experimental sampler to make LLMs more creative☆23Aug 2, 2023Updated 2 years ago
- A Python package designed to simplify the process of creating and managing function calls to OpenAI's API, as well as models using LiteLL…☆17May 25, 2025Updated 11 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,915Sep 30, 2023Updated 2 years ago
- ☆135Apr 8, 2026Updated 3 weeks ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- An interface for llama.cpp, ChatGPT, Gemini, and Claude☆27Apr 16, 2026Updated 2 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆95Apr 2, 2026Updated 3 weeks ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Dec 8, 2023Updated 2 years ago
- Wheels for llama-cpp-python compiled with cuBLAS support☆103Feb 1, 2024Updated 2 years ago
- An omnipowerful personal assistant powered by LLMs, Zapier NLA, and custom actions.☆15Sep 13, 2024Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆268Mar 6, 2025Updated last year
- A OpenAI API compatible REST server for llama.☆208Feb 24, 2025Updated last year
- VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)☆36Feb 28, 2024Updated 2 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple extension that allows LLM to speak in any voice, literally, based on Sliero TTS which is available in oobabooga's textgen-webui …☆12Aug 26, 2023Updated 2 years ago
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆24Aug 1, 2024Updated last year
- ☆12Apr 28, 2023Updated 3 years ago
- Oobabooga extension for Bark TTS☆119Nov 23, 2023Updated 2 years ago
- An extension to oobabooga's TextGen allowing you to receive pics generated by Automatic1111's SD API☆12May 16, 2023Updated 2 years ago
- NREPL for Hy☆11Aug 8, 2025Updated 8 months ago
- Get more done with LLMs☆13Jan 19, 2024Updated 2 years ago
- Train LLMs by just modifying config files!☆24Nov 23, 2023Updated 2 years ago
- OpenOrca-KO dataset을 활용하여 llama2를 fine-tuning한 Korean-OpenOrca☆18Nov 1, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 디시인사이드 로그라이크 갤러리 카타클리즘 모드팩☆14Apr 10, 2026Updated 2 weeks ago
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆17Apr 8, 2026Updated 3 weeks ago
- XTTSv2 Extension for oobabooga text-generation-webui☆157Nov 21, 2023Updated 2 years ago
- jQuery, React and Streamlit applications written by LLMs☆15Dec 24, 2023Updated 2 years ago
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- This package provides Swift bindings for llama.cpp☆26Apr 4, 2023Updated 3 years ago
- Ehnd 의 웹 번역 버전입니다.☆10Feb 7, 2022Updated 4 years ago