Lightning-AI / LitServeLinks
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
☆3,746Updated last week
Alternatives and similar repositories for LitServe
Users that are interested in LitServe are comparing it to the libraries listed below
Sorting:
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,308Updated last month
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,578Updated last week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,803Updated 7 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,945Updated last week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,789Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,156Updated last week
- The python library for real-time communication☆4,464Updated last month
- Knowledge Agents and Management in the Cloud☆4,223Updated 2 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,570Updated 7 months ago
- Fast State-of-the-Art Static Embeddings☆1,959Updated last month
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,592Updated last week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,623Updated 3 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,649Updated last week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,522Updated 7 months ago
- Composable building blocks to build LLM Apps☆8,210Updated last week
- PyTorch native post-training library☆5,629Updated last week
- A system for agentic LLM-powered data processing and ETL☆3,310Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,584Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,818Updated 2 months ago
- ☆3,054Updated last month
- Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images☆2,717Updated last week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,489Updated last year
- Deploy your agentic worfklows to production☆2,068Updated 2 weeks ago
- Everything about the SmolLM and SmolVLM family of models☆3,499Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,995Updated last week
- ☆2,123Updated last week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,798Updated last week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,875Updated last week
- 🦜⛏️ Did you say you like data?☆1,179Updated 2 months ago
- LLM abstractions that aren't obstructions☆1,333Updated this week