An OpenAI-like LLaMA inference API
☆113Sep 17, 2023Updated 2 years ago
Alternatives and similar repositories for llama-api
Users that are interested in llama-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Chat is an open-source serverless alternative to ChatGPT.☆36Sep 13, 2024Updated last year
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- The accompany backend for PAI app☆12Mar 24, 2025Updated last year
- ☆15May 20, 2023Updated 3 years ago
- A Python package designed to simplify the process of creating and managing function calls to OpenAI's API, as well as models using LiteLL…☆17May 25, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,922Sep 30, 2023Updated 2 years ago
- ☆136May 26, 2026Updated 2 weeks ago
- Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Tra…☆1,287Jan 24, 2024Updated 2 years ago
- Personnal collection of pipes and filters I use for open-webui☆27Apr 15, 2026Updated last month
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- ☆16Dec 16, 2024Updated last year
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Dec 8, 2023Updated 2 years ago
- Wheels for llama-cpp-python compiled with cuBLAS support☆103Feb 1, 2024Updated 2 years ago
- An omnipowerful personal assistant powered by LLMs, Zapier NLA, and custom actions.☆15Sep 13, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆266Mar 6, 2025Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 4 months ago
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆24Aug 1, 2024Updated last year
- ☆12Apr 28, 2023Updated 3 years ago
- The source code of the game I made for the HuggingFace game jam☆16Jul 25, 2023Updated 2 years ago
- Get more done with LLMs☆13Jan 19, 2024Updated 2 years ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Aug 18, 2024Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Oct 2, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆17Apr 8, 2026Updated 2 months ago
- An easy-to-use scikit-learn inspired implementation of the Multidimensional Multiclass Genetic Programming with Multidimensional Populati…☆11Jun 2, 2026Updated last week
- Prototype web interface that enables remote teleoperation of the Stretch RE1 mobile manipulator from Hello Robot Inc.☆12Dec 14, 2023Updated 2 years ago
- Run any Large Language Model behind a unified API☆169Nov 13, 2023Updated 2 years ago
- jQuery, React and Streamlit applications written by LLMs☆16Dec 24, 2023Updated 2 years ago
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- This package provides Swift bindings for llama.cpp☆26Apr 4, 2023Updated 3 years ago
- Ehnd 의 웹 번역 버전입니다.☆10Feb 7, 2022Updated 4 years ago
- A frontend for creative writing with LLMs☆161Jul 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Web UI for ExLlamaV2☆512Feb 5, 2025Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆368Dec 9, 2023Updated 2 years ago
- ☆12May 20, 2025Updated last year
- Discord chatbot interface to train an LLM on user message history☆27Jun 9, 2023Updated 3 years ago
- Data Analysis, Analytics, Science, AI & ML, LLM etc.☆15Jun 6, 2025Updated last year
- LLM shell and document interogator☆14Jul 24, 2023Updated 2 years ago
- A full-stack Webui implementation of Large Language model, such as ChatGPT or LLaMA.☆290Jul 25, 2024Updated last year