Lightning-AI / LitServeLinks
The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.
☆3,361Updated this week
Alternatives and similar repositories for LitServe
Users that are interested in LitServe are comparing it to the libraries listed below
Sorting:
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,131Updated 4 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,263Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,550Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,060Updated 2 weeks ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,375Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,192Updated last week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,698Updated this week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,587Updated last week
- The python library for real-time communication☆4,096Updated 3 weeks ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,596Updated last month
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,002Updated this week
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,372Updated this week
- Structured Outputs☆11,990Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,507Updated this week
- The easiest way to use Agentic RAG in any enterprise☆4,277Updated 5 months ago
- Fast State-of-the-Art Static Embeddings☆1,746Updated last month
- Deploy your agentic worfklows to production☆2,031Updated this week
- A blazing fast inference solution for text embeddings models☆3,758Updated this week
- structured outputs for llms☆10,876Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,796Updated last week
- A language model programming library.☆5,789Updated last month
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,231Updated 3 months ago
- Knowledge Agents and Management in the Cloud☆4,035Updated last week
- PyTorch native post-training library☆5,306Updated this week
- ☆2,977Updated 9 months ago
- Curated list of datasets and tools for post-training.☆3,226Updated 5 months ago
- Everything about the SmolLM2 and SmolVLM family of models☆2,623Updated last week
- Open-source AI cookbook☆2,138Updated 2 weeks ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,195Updated last week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,298Updated this week