An OpenAI-like LLaMA inference API
☆113Sep 17, 2023Updated 2 years ago
Alternatives and similar repositories for llama-api
Users that are interested in llama-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM Chat is an open-source serverless alternative to ChatGPT.☆36Sep 13, 2024Updated last year
- StrategyQA 데이터 세트 번역☆22Apr 12, 2024Updated 2 years ago
- The accompany backend for PAI app☆12Mar 24, 2025Updated last year
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- ☆19Jul 23, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,924Sep 30, 2023Updated 2 years ago
- ☆136May 26, 2026Updated last month
- Personnal collection of pipes and filters I use for open-webui☆27Apr 15, 2026Updated 2 months ago
- ☆16Dec 16, 2024Updated last year
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆100Apr 2, 2026Updated 2 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Dec 8, 2023Updated 2 years ago
- An omnipowerful personal assistant powered by LLMs, Zapier NLA, and custom actions.☆15Sep 13, 2024Updated last year
- VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)☆36Feb 28, 2024Updated 2 years ago
- Vpin caculation and backtesting☆14Aug 16, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- A simple extension that allows LLM to speak in any voice, literally, based on Sliero TTS which is available in oobabooga's textgen-webui …☆12Aug 26, 2023Updated 2 years ago
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆24Aug 1, 2024Updated last year
- ☆12Apr 28, 2023Updated 3 years ago
- Oobabooga extension for Bark TTS☆119Nov 23, 2023Updated 2 years ago
- The source code of the game I made for the HuggingFace game jam☆16Jul 25, 2023Updated 2 years ago
- Get more done with LLMs☆13Jan 19, 2024Updated 2 years ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Aug 18, 2024Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Oct 2, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- NREPL for Hy☆12Aug 8, 2025Updated 10 months ago
- ☆40Aug 26, 2025Updated 10 months ago
- OpenOrca-KO dataset을 활용하여 llama2를 fine-tuning한 Korean-OpenOrca☆18Nov 1, 2023Updated 2 years ago
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆17Apr 8, 2026Updated 2 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆157Nov 21, 2023Updated 2 years ago
- jQuery, React and Streamlit applications written by LLMs☆15Dec 24, 2023Updated 2 years ago
- Easily create LLM automation/agent workflows☆59Feb 13, 2024Updated 2 years ago
- A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…☆26Aug 16, 2024Updated last year
- Web UI for ExLlamaV2☆513Feb 5, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆368Dec 9, 2023Updated 2 years ago
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆17Jul 15, 2023Updated 2 years ago
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,261Updated this week
- Discord chatbot interface to train an LLM on user message history☆27Jun 9, 2023Updated 3 years ago
- Data Analysis, Analytics, Science, AI & ML, LLM etc.☆15Jun 6, 2025Updated last year
- LLM shell and document interogator☆14Jul 24, 2023Updated 2 years ago
- A full-stack Webui implementation of Large Language model, such as ChatGPT or LLaMA.☆290Jul 25, 2024Updated last year