Lightning-AI / LitServeLinks
The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.
☆3,382Updated last week
Alternatives and similar repositories for LitServe
Users that are interested in LitServe are comparing it to the libraries listed below
Sorting:
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,600Updated last week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,707Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,207Updated last week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,447Updated this week
- Fast State-of-the-Art Static Embeddings☆1,756Updated this week
- The python library for real-time communication☆4,128Updated last week
- ☆2,984Updated 10 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,569Updated 2 months ago
- Knowledge Agents and Management in the Cloud☆4,052Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,274Updated last month
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,306Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,499Updated last month
- 🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…☆1,313Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,520Updated last week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,147Updated 4 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,592Updated last week
- Composable building blocks to build Llama Apps☆7,907Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,806Updated this week
- A system for agentic LLM-powered data processing and ETL☆2,354Updated last week
- The easiest way to use Agentic RAG in any enterprise☆4,284Updated 5 months ago
- PyTorch native post-training library☆5,323Updated last week
- ☆1,902Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,473Updated this week
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,009Updated last week
- ETL, Analytics, Versioning for Unstructured Data☆2,606Updated this week
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,375Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,474Updated last week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,577Updated last week
- Everything about the SmolLM and SmolVLM family of models☆2,803Updated last week
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,207Updated this week