abgulati / hf-waitress
Serving LLMs in the HF-Transformers format via a PyFlask API
☆68Updated 3 months ago
Alternatives and similar repositories for hf-waitress:
Users that are interested in hf-waitress are comparing it to the libraries listed below
- idea: https://github.com/nyxkrage/ebook-groupchat/☆83Updated 3 months ago
- Easily view and modify JSON datasets for large language models☆65Updated 2 months ago
- ☆26Updated 2 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆96Updated 5 months ago
- Complex RAG backend☆28Updated 8 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆59Updated 2 months ago
- ☆117Updated this week
- On-demand model switching with llama.cpp (or other OpenAI compatible backends)☆109Updated this week
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆39Updated 2 weeks ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 5 months ago
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆36Updated last month
- A framework that uses multi-agents to enable users to perform a systematic data science pipeline with just two inputs.☆36Updated 4 months ago
- A frontend for creative writing with LLMs☆112Updated 5 months ago
- Integrates AI tools into Microsoft® Word® (independently developed, not affiliated with Microsoft)☆77Updated last week
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆48Updated 2 months ago
- run ollama & gguf easily with a single command☆48Updated 7 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆98Updated last month
- Embed anything.☆28Updated 6 months ago
- An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intellig…☆44Updated 4 months ago
- Something similar to Apple Intelligence?☆58Updated 5 months ago
- A discovery and compression tool for your Python codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your…☆68Updated last week
- ☆30Updated 7 months ago
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆133Updated last month
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 6 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 7 months ago
- ☆18Updated last week
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆60Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆24Updated last week
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆62Updated last month
- A Python library to orchestrate LLMs in a neural network-inspired structure☆44Updated 2 months ago