This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
☆26Mar 6, 2025Updated last year
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A public repo that contains integrations for Argilla and LlamaIndex.☆17Oct 10, 2024Updated last year
- Project shadows from sprites on transparent backgrounds☆13Aug 14, 2025Updated 10 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆187Dec 29, 2025Updated 5 months ago
- Plug-and-play document AI with zero-shot models.☆126May 11, 2026Updated last month
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆17May 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Git scrapers for scraping the fediverse☆22Updated this week
- ☆13Feb 22, 2024Updated 2 years ago
- Simple Pygame application for Particle Effect showcasing (Tutorial)☆10Nov 11, 2023Updated 2 years ago
- Manage ML configuration with pydantic☆16Mar 18, 2026Updated 2 months ago
- ☆14Dec 21, 2025Updated 5 months ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- A Framework For Intelligence Farming☆16Apr 3, 2025Updated last year
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆31Nov 21, 2025Updated 6 months ago
- Wrap-up around RinteRface templates☆11Apr 10, 2019Updated 7 years ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆17Nov 7, 2022Updated 3 years ago
- ☆15Oct 19, 2024Updated last year
- Semantic emoji finder. Python/dash UI. Uses sentence transformer embeddings and duckdb☆20Sep 15, 2025Updated 8 months ago
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- This is a question-output workflow template for shiny app!☆12May 17, 2019Updated 7 years ago
- TUI command launcher 🚀☆31May 15, 2026Updated 3 weeks ago
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Mar 23, 2026Updated 2 months ago
- ☆79Jun 5, 2024Updated 2 years ago
- Code to create bugged python scripts for OpenAssistant Training, maintained by https://twitter.com/Cyndesama☆24Jul 23, 2023Updated 2 years ago
- Anthropic Computer Use with Modal Sandboxes☆49Oct 23, 2024Updated last year
- ClaudeWatch - A tool to track claude active claude sessions with Warp☆78Mar 23, 2026Updated 2 months ago
- Declarative AI Pipelines☆22Oct 2, 2024Updated last year
- REST API for Large Language Models using FastAPI, Redis and LiteLLM☆14Nov 30, 2023Updated 2 years ago
- MCP Server for QA Sphere TMS☆22Updated this week
- Local FAISS vector store as an MCP server – Agent Memory, drop-in local semantic search for Claude / Copilot / Agents.☆30Apr 24, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Collection of Pydantic Models to Abstract IRL☆41Dec 10, 2025Updated 6 months ago
- ☆34May 15, 2026Updated 3 weeks ago
- 🧇 Retrieves location closure info for Waffle House and computes the Waffle House Index (% of locations closed)☆11Mar 26, 2020Updated 6 years ago
- Machinery data, made easy. Easily download and prepare common industrial datasets.☆23Feb 13, 2024Updated 2 years ago
- A command-line tool for creating and managing external HITs on Amazon's Mechanical Turk☆15Jan 11, 2021Updated 5 years ago
- Beat for traceroute command☆14Oct 5, 2018Updated 7 years ago
- Slides and materials for my talk to the Madison R Users Group (September 21st, 2016)☆15Oct 17, 2016Updated 9 years ago