Lightning-AI / LitServeLinks
The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.
☆3,298Updated this week
Alternatives and similar repositories for LitServe
Users that are interested in LitServe are comparing it to the libraries listed below
Sorting:
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,685Updated this week
- The python library for real-time communication☆4,037Updated last week
- A system for agentic LLM-powered data processing and ETL☆2,223Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,496Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,757Updated last week
- PyTorch native post-training library☆5,273Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,593Updated last month
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,328Updated 2 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,495Updated 2 weeks ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,148Updated this week
- Fast State-of-the-Art Static Embeddings☆1,732Updated 2 weeks ago
- Everything about the SmolLM2 and SmolVLM family of models☆2,574Updated 2 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,101Updated 4 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,262Updated 4 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,574Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,024Updated last month
- ☆1,816Updated 2 weeks ago
- The LLM Evaluation Framework☆8,370Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,449Updated 3 weeks ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,964Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,407Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,247Updated this week
- Knowledge Agents and Management in the Cloud☆4,014Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,429Updated last month
- ☆2,965Updated 9 months ago
- Deploy your agentic worfklows to production☆2,026Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆797Updated 4 months ago
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,365Updated this week
- Agent Framework / shim to use Pydantic with LLMs☆10,271Updated this week
- Tool for generating high quality Synthetic datasets☆948Updated last week