This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
☆26Mar 6, 2025Updated last year
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆16Aug 3, 2023Updated 2 years ago
- Project shadows from sprites on transparent backgrounds☆13Aug 14, 2025Updated 8 months ago
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆17May 7, 2024Updated last year
- Agent based market simulation☆15Aug 10, 2024Updated last year
- Git scrapers for scraping the fediverse☆21Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Simple Pygame application for Particle Effect showcasing (Tutorial)☆10Nov 11, 2023Updated 2 years ago
- ☆14Dec 21, 2025Updated 4 months ago
- A Datasette plugin for making data visualizations with Observable Plot☆26Oct 21, 2025Updated 6 months ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- PostgreSQL Lance Table Extension☆25Dec 27, 2025Updated 4 months ago
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- A utility for async batch jobs in marimo☆13Mar 12, 2025Updated last year
- marimo + pixi starter template☆18Jan 31, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB☆16Feb 21, 2026Updated 2 months ago
- Tui Utility to test REST APIs☆13Nov 20, 2023Updated 2 years ago
- Wrap-up around RinteRface templates☆11Apr 10, 2019Updated 7 years ago
- ☆15Oct 19, 2024Updated last year
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- Tools to collect detailed usage analytics of Idyll articles.☆15Nov 18, 2024Updated last year
- ☆37Mar 5, 2026Updated 2 months ago
- Code for building self-expanding knowledge graphs with Outlines, vLLM, neo4j, and Modal.☆37May 14, 2025Updated 11 months ago
- 🖼 A minimal R client for interacting with Instagram’s public API☆14Aug 30, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a package to implement the Robust Latent Dirichlet Approach in R.☆10Apr 25, 2019Updated 7 years ago
- Enhanced note taking for AI Agents with supervision.☆39Nov 24, 2025Updated 5 months ago
- Golang SDK for Truss☆40Apr 8, 2026Updated 3 weeks ago
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Mar 23, 2026Updated last month
- ☆80Jun 5, 2024Updated last year
- A best-of list for all awesome projects written in textual☆19Jun 6, 2024Updated last year
- MoodCat😼 classifies the mood of English sentences.☆14Jun 19, 2022Updated 3 years ago
- 🐧🐦 Generate HTML pages for Twitter statuses.☆14Jul 22, 2018Updated 7 years ago
- A spatial-temporal map of the whole human history backed by a small SQLite db in browser.☆21Jul 7, 2025Updated 9 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Anthropic Computer Use with Modal Sandboxes☆49Oct 23, 2024Updated last year
- Intentional is an open-source framework to build reliable LLM chatbots that actually talk and behave as you expect.☆13Dec 31, 2024Updated last year
- A OCR labeling tool - made for docTR☆21Apr 23, 2026Updated last week
- Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"☆17Jul 1, 2024Updated last year
- REST API for Large Language Models using FastAPI, Redis and LiteLLM☆14Nov 30, 2023Updated 2 years ago
- ☆15Dec 2, 2019Updated 6 years ago
- Minimalistic Go statusline for Claude Code☆37Apr 25, 2026Updated last week