Lightning-AI / LitServeLinks
The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.
☆3,346Updated this week
Alternatives and similar repositories for LitServe
Users that are interested in LitServe are comparing it to the libraries listed below
Sorting:
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,165Updated last month
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,698Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,175Updated last week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,131Updated 4 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,550Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,596Updated last month
- Fast State-of-the-Art Static Embeddings☆1,746Updated last month
- Knowledge Agents and Management in the Cloud☆4,035Updated this week
- ☆2,977Updated 9 months ago
- PyTorch native post-training library☆5,296Updated this week
- A system for agentic LLM-powered data processing and ETL☆2,326Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,375Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,271Updated last week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,563Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,481Updated last month
- Composable building blocks to build Llama Apps☆7,886Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,788Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,434Updated last week
- Blazingly fast LLM inference.☆5,802Updated this week
- ☆1,849Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,060Updated last week
- The easiest way to use Agentic RAG in any enterprise☆4,277Updated 5 months ago
- ETL, Analytics, Versioning for Unstructured Data☆2,593Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,020Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,437Updated last month
- Everything about the SmolLM2 and SmolVLM family of models☆2,606Updated last week
- Supercharge Your LLM Application Evaluations 🚀☆9,799Updated this week
- Deploy your agentic worfklows to production☆2,028Updated last week
- Tools for merging pretrained large language models.☆5,937Updated 2 weeks ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,545Updated 4 months ago