This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
☆26Mar 6, 2025Updated last year
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆16Aug 3, 2023Updated 2 years ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆184Dec 29, 2025Updated 3 months ago
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆17May 7, 2024Updated last year
- Git scrapers for scraping the fediverse☆20Updated this week
- ☆12Feb 22, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple Pygame application for Particle Effect showcasing (Tutorial)☆10Nov 11, 2023Updated 2 years ago
- Manage ML configuration with pydantic☆16Mar 18, 2026Updated 3 weeks ago
- ☆14Dec 21, 2025Updated 3 months ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆29Nov 21, 2025Updated 4 months ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- A Framework For Intelligence Farming☆16Apr 3, 2025Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- Tui Utility to test REST APIs☆13Nov 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tutorials, templates for running glassflow pipelines☆30Feb 12, 2025Updated last year
- Wrap-up around RinteRface templates☆11Apr 10, 2019Updated 7 years ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆17Nov 7, 2022Updated 3 years ago
- ☆15Oct 19, 2024Updated last year
- Obsidian plugin that allows to display contents of Arc sidebar right besides your notes☆14Jan 26, 2024Updated 2 years ago
- Tools to collect detailed usage analytics of Idyll articles.☆15Nov 18, 2024Updated last year
- This is a package to implement the Robust Latent Dirichlet Approach in R.☆10Apr 25, 2019Updated 6 years ago
- Human-driven multi-agent dashboard for Claude Code, Codex, Gemini & OpenCode. Web UI, project wiki, shared content, and MCP-based agent…☆30Updated this week
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆80Jun 5, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- ☆21Apr 9, 2025Updated last year
- A spatial-temporal map of the whole human history backed by a small SQLite db in browser.☆21Jul 7, 2025Updated 9 months ago
- [ACL 2022] CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations☆10Jun 5, 2022Updated 3 years ago
- Anthropic Computer Use with Modal Sandboxes☆47Oct 23, 2024Updated last year
- A Collection of Pydantic Models to Abstract IRL☆39Dec 10, 2025Updated 4 months ago
- ☆23Updated this week
- Materials for the "Apps and Dashboards with Shiny " workshop at WSDS 2018☆19Jun 5, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Declarative AI Pipelines☆22Oct 2, 2024Updated last year
- Local FAISS vector store as an MCP server – Agent Memory, drop-in local semantic search for Claude / Copilot / Agents.☆26Mar 8, 2026Updated last month
- ☆15Dec 2, 2019Updated 6 years ago
- A rolling version of the Latent Dirichlet Allocation.☆13Nov 27, 2023Updated 2 years ago
- 🧇 Retrieves location closure info for Waffle House and computes the Waffle House Index (% of locations closed)☆11Mar 26, 2020Updated 6 years ago
- Machinery data, made easy. Easily download and prepare common industrial datasets.☆23Feb 13, 2024Updated 2 years ago
- A command-line tool for creating and managing external HITs on Amazon's Mechanical Turk☆15Jan 11, 2021Updated 5 years ago