LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
☆135Jun 10, 2023Updated 2 years ago
Alternatives and similar repositories for llama-server
Users that are interested in llama-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python bindings for llama.cpp☆68Feb 29, 2024Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 6 months ago
- Falcon LLM ggml framework with CPU and GPU support☆250Jan 22, 2024Updated 2 years ago
- IRIS: Demonstrator for use of LLMs in python (outdated)☆62Mar 23, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated 2 years ago
- A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…☆595Jun 12, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated 2 years ago
- Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust. *This is the Python fork/port* from https://g…☆22Nov 10, 2025Updated 6 months ago
- Experimental adventure game with AI-generated content☆111Apr 15, 2025Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.☆11May 26, 2023Updated 3 years ago
- An embeddable widget for interacting with openAI api compatable LLM's☆15Sep 18, 2024Updated last year
- ☆16May 31, 2024Updated 2 years ago
- HuggingChat like UI in Gradio☆69May 23, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Chat²GPT is a ChatGPT (and DALL·E 2/3, and ElevenLabs) chat bot for Google Chat. 🤖💬☆11Feb 2, 2026Updated 3 months ago
- ☆16Mar 11, 2025Updated last year
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- Detect and identify different species of harmful algae within natural water in real-time with AI and a camera (i.e., ESP32-CAM, smartphon…☆15Apr 30, 2026Updated last month
- Claude Project Coordinator is a Swift-powered MCP (Model Context Protocol) server designed to streamline multi-project Xcode development.…☆46Apr 25, 2026Updated last month
- A Neural Audio Codec (NAC) for Universal Audio☆46May 30, 2025Updated last year
- ☆52Feb 5, 2025Updated last year
- Realtime News and Information Eval☆19Mar 26, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 中文原生工业测评基准☆16Mar 21, 2024Updated 2 years ago
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆14Dec 28, 2023Updated 2 years ago
- Python bindings for llama.cpp☆10,344May 24, 2026Updated last week
- larry.ai: A Batteries Included ChatGPT Frontend Framework & HTTP Proxy☆17Jan 16, 2024Updated 2 years ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆48Jul 18, 2025Updated 10 months ago
- private-machine is an AI companion system with emotion, needs and goals simulation. Very silly, not based on real science.☆35Apr 5, 2026Updated last month
- ☆22Sep 4, 2023Updated 2 years ago
- Context Strategy Framework: Intent → Build → Learn workflow. Preserves context in AI-generated code.☆38Jan 27, 2026Updated 4 months ago
- PostgreSQL SKILLs for AI Agent☆35Feb 5, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12May 14, 2026Updated 2 weeks ago
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆40Jun 4, 2025Updated 11 months ago
- Embedding models from Jina AI☆66Jan 18, 2024Updated 2 years ago
- This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).☆47Feb 8, 2026Updated 3 months ago
- Personalized all-purpose AI assistance platform based on hierarchical cooperative multi-agent framework which utilizes websocket connecti…☆38Aug 11, 2024Updated last year
- Browser-native cloud OS - Unix environment running entirely in the browser via WebAssembly and IndexedDB☆41Mar 19, 2026Updated 2 months ago
- Implementation of "CBCT-Dental Scan Registration via Descriptor Representation Learning"☆15Jan 8, 2024Updated 2 years ago