Lightning-AI / LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
β2,996Updated this week
Alternatives and similar repositories for LitServe:
Users that are interested in LitServe are comparing it to the libraries listed below
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,328Updated last month
- π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking libraryβ2,818Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ1,882Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ2,802Updated 2 weeks ago
- β2,889Updated 6 months ago
- Run PyTorch LLMs locally on servers, desktop and mobileβ3,530Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ3,961Updated last month
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ3,729Updated 7 months ago
- β1,567Updated last week
- PyTorch native post-training libraryβ5,014Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.β2,884Updated last week
- A system for agentic LLM-powered data processing and ETLβ1,718Updated this week
- Build and query dynamic, temporally-aware Knowledge Graphsβ2,478Updated this week
- Fast State-of-the-Art Static Embeddingsβ1,109Updated 3 weeks ago
- Knowledge Agents and Management in the Cloudβ3,791Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,316Updated last week
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors aβ¦β1,310Updated this week
- PyTorch native quantization and sparsity for training and inferenceβ1,913Updated this week
- structured outputs for llmsβ9,860Updated this week
- Agent Framework / shim to use Pydantic with LLMsβ7,308Updated this week
- The python library for real-time communicationβ3,148Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.β2,473Updated last week
- A language model programming library.β5,689Updated 3 weeks ago
- Blazingly fast LLM inference.β5,240Updated this week
- RAG that intelligently adapts to your use case, data, and queriesβ3,042Updated 3 weeks ago
- Harness LLMs with Multi-Agent Programmingβ3,171Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,338Updated last month
- Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.β5,922Updated this week
- Tools for merging pretrained large language models.β5,458Updated this week
- πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Openβ¦β9,536Updated this week