A collection of LLM services you can self host via docker or modal labs to support your applications development
☆201Apr 29, 2024Updated 2 years ago
Alternatives and similar repositories for fastllm
Users that are interested in fastllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 3 weeks ago
- automatic sentence highlights based on their significance to the document☆196Nov 22, 2023Updated 2 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- A strongly typed Python DSL for developing message passing multi agent systems☆54Apr 9, 2024Updated 2 years ago
- ☆16Mar 23, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- structured outputs for llms☆12,889Apr 22, 2026Updated 2 weeks ago
- The Aesir Programming Language☆12Nov 5, 2023Updated 2 years ago
- A collection of tools for your LLMs that run on Modal☆26Feb 28, 2025Updated last year
- ☆22Oct 14, 2024Updated last year
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Sep 29, 2024Updated last year
- an ambient intelligence library☆6,146Apr 22, 2026Updated 2 weeks ago
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Mar 23, 2026Updated last month
- Apps that run on modal.com☆13Sep 14, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆38May 14, 2024Updated last year
- A library to use `modal` as a backend for `joblib`.☆32Jan 15, 2025Updated last year
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- Deploy a FastHTML app in just a few lines of simple python code on Modal's serverless infra.☆26Aug 19, 2024Updated last year
- Send email using markdown☆54Apr 28, 2026Updated last week
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- ☆195Apr 15, 2026Updated 3 weeks ago
- ☆230Jan 18, 2026Updated 3 months ago
- run embeddings in MLX☆98Sep 27, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 7 months ago
- A public release of TimelineBuilder for building personal digital data timelines.☆370Sep 3, 2024Updated last year
- A program synthesis agent that autonomously fixes its output by running tests!☆468Sep 19, 2024Updated last year
- Application to take in video urls and stream either transcripts or tokens.☆76Aug 28, 2023Updated 2 years ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.☆104Sep 9, 2023Updated 2 years ago
- A Python wrapper for the bbhash library for Minimal Perfect Hashing☆19Oct 26, 2025Updated 6 months ago
- ☆15Apr 26, 2025Updated last year
- ☆15Sep 15, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Demo of ConversationEntityMemory in Streamlit.☆51Jan 23, 2023Updated 3 years ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,918May 17, 2025Updated 11 months ago
- 👾📦 CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.☆364Jan 30, 2025Updated last year
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Sep 21, 2023Updated 2 years ago
- Live demo of shot-scraper☆41Mar 2, 2025Updated last year
- ⛓️ build cognitive systems, pythonic☆340Nov 19, 2024Updated last year
- ☆56Apr 2, 2026Updated last month