A collection of LLM services you can self host via docker or modal labs to support your applications development
☆201Apr 29, 2024Updated last year
Alternatives and similar repositories for fastllm
Users that are interested in fastllm are comparing it to the libraries listed below
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 11 months ago
- Utility to use eleven lab's streaming to in the command line☆11Aug 8, 2023Updated 2 years ago
- automatic sentence highlights based on their significance to the document☆197Nov 22, 2023Updated 2 years ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Apr 9, 2024Updated last year
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- ☆21Oct 14, 2024Updated last year
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- A collection of tools for your LLMs that run on Modal☆23Feb 28, 2025Updated last year
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- structured outputs for llms☆12,468Feb 25, 2026Updated last week
- ☆16Mar 23, 2023Updated 2 years ago
- an ambient intelligence library☆6,089Feb 27, 2026Updated last week
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.☆104Sep 9, 2023Updated 2 years ago
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Sep 21, 2023Updated 2 years ago
- run embeddings in MLX☆97Sep 27, 2024Updated last year
- A library to use `modal` as a backend for `joblib`.☆32Jan 15, 2025Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Sep 29, 2024Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 5 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,868May 17, 2025Updated 9 months ago
- Demo of ConversationEntityMemory in Streamlit.☆52Jan 23, 2023Updated 3 years ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37May 14, 2024Updated last year
- A public release of TimelineBuilder for building personal digital data timelines.☆371Sep 3, 2024Updated last year
- ☆220Jan 18, 2026Updated last month
- ☆19May 6, 2023Updated 2 years ago
- ☆66Aug 5, 2025Updated 7 months ago
- Synchronicity lets you interoperate with asynchronous Python APIs.☆133Dec 27, 2025Updated 2 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆27Mar 6, 2025Updated last year
- A curated list of resources related to structured generation 🔥☆23Jul 25, 2025Updated 7 months ago
- A framework for evaluating function calls made by LLMs☆40Jul 23, 2024Updated last year
- Vanilla-Python ergonomics on top of DSPy☆40Jun 3, 2025Updated 9 months ago
- Code Interpreter Replica☆26Jul 14, 2023Updated 2 years ago
- A GPT agent framework for invoking APIs☆736Jun 23, 2023Updated 2 years ago
- ☆184Feb 2, 2026Updated last month
- Chat language model that can use tools and interpret the results☆1,591Dec 3, 2025Updated 3 months ago
- ☆473Dec 27, 2023Updated 2 years ago
- Sparse autoencoders for Contra text embedding models☆25Apr 24, 2024Updated last year
- Memory library for building stateful agents☆383Feb 27, 2026Updated last week
- ⛓️ build cognitive systems, pythonic☆339Nov 19, 2024Updated last year