abgulati/hf-waitress

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abgulati/hf-waitress)

abgulati / hf-waitress

Serving LLMs in the HF-Transformers format via a PyFlask API

☆72

Alternatives and similar repositories for hf-waitress

Users that are interested in hf-waitress are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

znfgnu / easy-agent
View on GitHub
Simple agent framework using Ollama tool calling
☆10Aug 27, 2024Updated last year
fishiatee / Tumera
View on GitHub
Yet another frontend for LLM, written using .NET and WinUI 3
☆11Sep 14, 2025Updated 10 months ago
remichu-ai / gallama
View on GitHub
☆137Jun 30, 2026Updated 3 weeks ago
calmstate / Itinerant
View on GitHub
A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.
☆19Aug 30, 2024Updated last year
abgulati / LARS
View on GitHub
An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.
☆637Oct 29, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
truemagic-coder / nemo-agent
View on GitHub
Your Python AI Coder!
☆36May 21, 2025Updated last year
Danmoreng / llm-pen
View on GitHub
☆16Feb 21, 2026Updated 5 months ago
bdytx5 / open_answer_engine
View on GitHub
☆23Aug 9, 2024Updated last year
vincentdnl / operativeai
View on GitHub
☆26May 31, 2024Updated 2 years ago
l33tkr3w / LlamaCards
View on GitHub
LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …
☆36Aug 28, 2024Updated last year
julianthomas04 / Nova1
View on GitHub
An Open-Source Modular AI Assistant
☆32Mar 20, 2025Updated last year
RobotTelevision / CrowAssistant
View on GitHub
Crow is a Desktop AI Assistant
☆33Aug 9, 2024Updated last year
shoibloya / nuggt-research
View on GitHub
☆21Jan 25, 2025Updated last year
fishiatee / yawullm
View on GitHub
Yet Another (LLM) Web UI, made with Gemini
☆12Dec 25, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mpazdzioch / llamacpp-webui-glue
View on GitHub
☆36Aug 21, 2025Updated 11 months ago
wavify-labs / wavify-sdks
View on GitHub
fast state-of-the-art speech models and a runtime that runs anywhere 💥
☆58Feb 10, 2026Updated 5 months ago
zenoverflow / omnichain
View on GitHub
Efficient visual programming for AI language models
☆357May 13, 2025Updated last year
MatN23 / AdaptiveTrainingSystem
View on GitHub
A PyTorch framework for training transformer language models with Mixture of Experts (MoE) architecture support, Mixture of Depths (MoD),…
☆21Updated this week
intelligencedev / eternal
View on GitHub
Eternal is an experimental platform for machine learning models and workflows.
☆70Mar 9, 2025Updated last year
leafspark / AutoGGUF
View on GitHub
automatically quant GGUF models
☆226Dec 23, 2025Updated 7 months ago
calmstate / polyglot
View on GitHub
Polyglot is a fast, elegant, and free translation tool using AI.
☆66Nov 21, 2025Updated 8 months ago
PDBeurope / protvista-pdb
View on GitHub
PDB ProtVista Viewer
☆11Updated this week
PasiKoodaa / Chat-with-Screen
View on GitHub
☆20Sep 28, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Cerebras / DocChat
View on GitHub
GPT-4 Level Conversational QA Trained In a Few Hours
☆69Aug 21, 2024Updated last year
Electrofried / Astarte
View on GitHub
☆11Feb 20, 2025Updated last year
IST-DASLab / gptq-gguf-toolkit
View on GitHub
Efficient non-uniform quantization with GPTQ for GGUF
☆64Sep 17, 2025Updated 10 months ago
lynthera / bitsegments_localminds
View on GitHub
Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.
☆22Jan 10, 2026Updated 6 months ago
dominikhei / terraform-provider-ecr-build-push-image
View on GitHub
Terraform provider to build Docker images and push them to AWS ECR
☆10May 24, 2025Updated last year
cp3249 / splaa
View on GitHub
SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…
☆29May 6, 2025Updated last year
lucasavila00 / LmScript
View on GitHub
Controllable Language Model Interactions in TypeScript
☆10May 17, 2024Updated 2 years ago
PerminovEugene / messy-folder-reorganizer-ai
View on GitHub
🤖 AI-powered CLI for file reorganization. Runs fully locally — no data leaves your machine.
☆20Jul 2, 2025Updated last year
lynxai-team / agent-smith
View on GitHub
Local first human friendly agents toolkit for the browser and Nodejs
☆45Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
LAION-AI / Desktop_BUD-E
View on GitHub
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆44Jul 18, 2024Updated 2 years ago
astramind-ai / Pulsar
View on GitHub
The hearth of The Pulsar App, fast, secure and shared inference with modern UI
☆58Dec 1, 2024Updated last year
juanmackie / ccswap
View on GitHub
A simple, cross-platform CLI tool for quickly switching between Claude Code configuration profiles by managing different settings.json ve…
☆16Jan 17, 2026Updated 6 months ago
v2rockets / Loyal-Elephie
View on GitHub
Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible
☆353Feb 28, 2025Updated last year
TC-Zheng / ActuosusAI
View on GitHub
AI management tool
☆119Nov 9, 2024Updated last year
SomeOddCodeGuy / OfflineWikipediaTextApi
View on GitHub
This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …
☆106May 31, 2026Updated last month
qingy1337 / xplore-terminallm
View on GitHub
Allows two LLMs to communicate and run code in the terminal
☆28Dec 8, 2024Updated last year