nuance1979/llama-server

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nuance1979/llama-server)

nuance1979 / llama-server

LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.

☆135

Alternatives and similar repositories for llama-server

Users that are interested in llama-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

absadiki / pyllamacpp
View on GitHub
Python bindings for llama.cpp
☆68Feb 29, 2024Updated 2 years ago
OpenAccess-AI-Collective / ggml-webui
View on GitHub
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
☆38Jun 6, 2023Updated 3 years ago
cmp-nct / ggllm.cpp
View on GitHub
Falcon LLM ggml framework with CPU and GPU support
☆249Jul 2, 2026Updated last week
dkjroot / iris-llm
View on GitHub
IRIS: Demonstrator for use of LLMs in python (outdated)
☆62Mar 23, 2025Updated last year
serp-ai / unsloth
View on GitHub
5X faster 60% less memory QLoRA finetuning
☆21May 28, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
keldenl / gpt-llama.cpp
View on GitHub
A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI…
☆592Jun 12, 2023Updated 3 years ago
serp-ai / Parameter-Efficient-MoE
View on GitHub
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31May 22, 2024Updated 2 years ago
horseee / LLaMA-Pruning
View on GitHub
Structural Pruning for LLaMA
☆54May 20, 2023Updated 3 years ago
jefftriplett / hubcap
View on GitHub
Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust. *This is the Python fork/port* from https://g…
☆22Nov 10, 2025Updated 8 months ago
garysharp / EduHub.Data
View on GitHub
Provides strongly-typed object model for eduHub Data Sets
☆13Mar 11, 2026Updated 4 months ago
ejones / llama-journey
View on GitHub
Experimental adventure game with AI-generated content
☆111Apr 15, 2025Updated last year
helixml / chat-widget
View on GitHub
An embeddable widget for interacting with openAI api compatable LLM's
☆15Sep 18, 2024Updated last year
jploski / ggml
View on GitHub
Falcon7B + Falcon40B support - in branch falcon40b. Now all good and working. But main action now in https://github.com/cmp-nct/ggllm.cpp
☆10Sep 30, 2023Updated 2 years ago
OpenAccess-AI-Collective / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
☆11May 26, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
handplant / craftcms-lazy-starter-kit
View on GitHub
A modern Craft CMS starter kit for agencies and developers — featuring Vite, Tailwind, Datastar, DDEV, MCP, LLM Ready.
☆38Jun 24, 2026Updated 2 weeks ago
orkuhh / Auto-GPT-localLLMs
View on GitHub
Plugin Allows loading of local llms into Auto-GPT
☆12Apr 21, 2023Updated 3 years ago
iaalm / llama-api-server
View on GitHub
A OpenAI API compatible REST server for llama.
☆207Feb 24, 2025Updated last year
deep-diver / gradio-chat
View on GitHub
HuggingChat like UI in Gradio
☆69May 23, 2023Updated 3 years ago
derenlei / FactCG
View on GitHub
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data (NAACL 2025)
☆17Jul 14, 2025Updated last year
rhos-ai / IPR-1
View on GitHub
Official Repo of IPR-1 Project. https://www.rhos.ai/research/ipr-1
☆30Jun 29, 2026Updated 2 weeks ago
RAHB-REALTORS-Association / chat2gpt
View on GitHub
Chat²GPT is a ChatGPT (and DALL·E 2/3, and ElevenLabs) chat bot for Google Chat. 🤖💬
☆11Feb 2, 2026Updated 5 months ago
evalstate / fast-agent-docs
View on GitHub
Documentation site for fast-agent
☆31May 10, 2026Updated 2 months ago
binomed / sphero_ollie-web-bluetooth
View on GitHub
Control a Sphero Ollie with web bluetooth
☆13Nov 7, 2016Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
abgulati / hf-waitress
View on GitHub
Serving LLMs in the HF-Transformers format via a PyFlask API
☆72Sep 10, 2024Updated last year
IDEA-Emdoor-Lab / DistilCodec
View on GitHub
A Neural Audio Codec (NAC) for Universal Audio
☆46May 30, 2025Updated last year
Zyphra / transformers_zamba2
View on GitHub
☆49Feb 5, 2025Updated last year
ggml-org / p1
View on GitHub
LLM-based code completion engine
☆194Jan 23, 2025Updated last year
A-M-D-R-3-W / llmFunctionDecorator
View on GitHub
A Python package designed to simplify the process of creating and managing function calls to OpenAI's API, as well as models using LiteLL…
☆17May 25, 2025Updated last year
msminhas93 / ferric-micrograd
View on GitHub
A rust implementation of Andrej Karpathy's Micrograd
☆15Apr 28, 2025Updated last year
Tineyo / BoltAPP
View on GitHub
Web app for Sphero Bolt
☆14Sep 28, 2019Updated 6 years ago
Guy-Markman / PyHoot
View on GitHub
Kahoot clone based on Python, final project for Gvahim
☆12Jun 4, 2017Updated 9 years ago
vtuber-plan / langport
View on GitHub
Langport is a language model inference service
☆94Sep 9, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
abetlen / llama-cpp-python
View on GitHub
Python bindings for llama.cpp
☆10,485Updated this week
deerzq / Unsupervised-multi-metric-fusion-for-FR-IQA
View on GitHub
Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)
☆11Jul 11, 2014Updated 12 years ago
somewheresystems / llama2mlx
View on GitHub
Karpathy's llama2.c transpiled to MLX for Apple Silicon
☆14Dec 28, 2023Updated 2 years ago
rafaelpierre / larry-ai
View on GitHub
larry.ai: A Batteries Included ChatGPT Frontend Framework & HTTP Proxy
☆17Jan 16, 2024Updated 2 years ago
pixelomer / DeArrow-iOS
View on GitHub
☆16Sep 15, 2024Updated last year
harentius / GrammifyAI
View on GitHub
Minimalist LLM Grammar Checker for macOS
☆21Feb 22, 2026Updated 4 months ago
securade / sentinel
View on GitHub
Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.
☆30Apr 6, 2025Updated last year