Lightning-AI / LitServe
Deploy high-performance AI models and inference pipelines on FastAPI with built-in batching, streaming and more.
☆3,041Updated this week
Alternatives and similar repositories for LitServe:
Users that are interested in LitServe are comparing it to the libraries listed below
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,941Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆2,916Updated 3 weeks ago
- Knowledge Agents and Management in the Cloud☆3,875Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,942Updated last month
- The python library for real-time communication☆3,515Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,386Updated 2 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,189Updated 2 months ago
- PyTorch native post-training library☆5,084Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,999Updated last month
- ☆2,912Updated 7 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,407Updated 2 months ago
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,642Updated this week
- Deploy your agentic worfklows to production☆1,995Updated 3 weeks ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,016Updated 3 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,802Updated 8 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,027Updated last week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,375Updated 2 weeks ago
- A powerful framework for building realtime voice AI agents 🤖🎙️📹☆5,544Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,565Updated this week
- Vision agent☆4,492Updated this week
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,323Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,632Updated 2 weeks ago
- A blazing fast inference solution for text embeddings models☆3,414Updated last week
- ETL, Analytics, Versioning for Unstructured Data☆2,497Updated this week
- Everything about the SmolLM2 and SmolVLM family of models☆2,177Updated 2 weeks ago
- Build Real-Time Knowledge Graphs for AI Agents☆3,695Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆8,800Updated last week
- AI Observability & Evaluation☆5,352Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,629Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,948Updated this week