A collection of LLM services you can self host via docker or modal labs to support your applications development
☆201Apr 29, 2024Updated last year
Alternatives and similar repositories for fastllm
Users that are interested in fastllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 11 months ago
- Utility to use eleven lab's streaming to in the command line☆11Aug 8, 2023Updated 2 years ago
- automatic sentence highlights based on their significance to the document☆197Nov 22, 2023Updated 2 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Apr 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆16Mar 23, 2023Updated 3 years ago
- structured outputs for llms☆12,589Updated this week
- The Aesir Programming Language☆12Nov 5, 2023Updated 2 years ago
- A collection of tools for your LLMs that run on Modal☆23Feb 28, 2025Updated last year
- ☆22Oct 14, 2024Updated last year
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Get a markdown version of any webpage with a keyboard shortcut.☆66Feb 17, 2025Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Sep 29, 2024Updated last year
- an ambient intelligence library☆6,100Mar 18, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37May 14, 2024Updated last year
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Feb 27, 2026Updated 3 weeks ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 6 months ago
- A library to use `modal` as a backend for `joblib`.☆32Jan 15, 2025Updated last year
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- Deploy a FastHTML app in just a few lines of simple python code on Modal's serverless infra.☆26Aug 19, 2024Updated last year
- Send email using markdown☆54Feb 21, 2026Updated last month
- ☆187Feb 2, 2026Updated last month
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆224Jan 18, 2026Updated 2 months ago
- run embeddings in MLX☆97Sep 27, 2024Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- Multiple ways to model user preference in recommender systems☆18May 2, 2024Updated last year
- A public release of TimelineBuilder for building personal digital data timelines.☆371Sep 3, 2024Updated last year
- A program synthesis agent that autonomously fixes its output by running tests!☆469Sep 19, 2024Updated last year
- Application to take in video urls and stream either transcripts or tokens.☆76Aug 28, 2023Updated 2 years ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.☆104Sep 9, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆15Apr 26, 2025Updated 11 months ago
- ☆15Sep 15, 2023Updated 2 years ago
- Demo of ConversationEntityMemory in Streamlit.☆51Jan 23, 2023Updated 3 years ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,889May 17, 2025Updated 10 months ago
- 👾📦 CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.☆364Jan 30, 2025Updated last year
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Sep 21, 2023Updated 2 years ago
- Live demo of shot-scraper☆41Mar 2, 2025Updated last year