LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
☆133Jun 10, 2023Updated 2 years ago
Alternatives and similar repositories for llama-server
Users that are interested in llama-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python bindings for llama.cpp☆68Feb 29, 2024Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 5 months ago
- IRIS: Demonstrator for use of LLMs in python (outdated)☆62Mar 23, 2025Updated last year
- ☆17Apr 24, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…☆594Jun 12, 2023Updated 2 years ago
- Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)☆12Aug 18, 2021Updated 4 years ago
- Python scripts for AI voice changers☆14Apr 25, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Structural Pruning for LLaMA☆54May 20, 2023Updated 2 years ago
- Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust. *This is the Python fork/port* from https://g…☆22Nov 10, 2025Updated 5 months ago
- A tool for adding function calling to llm api, available as a service by following the link☆22Aug 11, 2025Updated 8 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.☆11May 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An embeddable widget for interacting with openAI api compatable LLM's☆14Sep 18, 2024Updated last year
- Plugin Allows loading of local llms into Auto-GPT☆12Apr 21, 2023Updated 2 years ago
- HuggingChat like UI in Gradio☆69May 23, 2023Updated 2 years ago
- Chat²GPT is a ChatGPT (and DALL·E 2/3, and ElevenLabs) chat bot for Google Chat. 🤖💬☆11Feb 2, 2026Updated 2 months ago
- ☆16Mar 11, 2025Updated last year
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- Documentation site for fast-agent☆29Updated this week
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- TypeScript 컴시간알리미 파서☆14Mar 31, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Claude Project Coordinator is a Swift-powered MCP (Model Context Protocol) server designed to streamline multi-project Xcode development.…☆45Jul 4, 2025Updated 9 months ago
- A Neural Audio Codec (NAC) for Universal Audio