Lightning-AI / LitServeLinks

The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.

☆3,382

Alternatives and similar repositories for LitServe

Users that are interested in LitServe are comparing it to the libraries listed below

Sorting:

pytorch / torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
☆3,600Updated last week
NVIDIA / nv-ingest
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…
☆2,707Updated this week
qdrant / fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
☆2,207Updated last week
SylphAI-Inc / AdalFlow
AdalFlow: The library to build & auto-optimize LLM applications.
☆3,447Updated this week
MinishLab / model2vec
Fast State-of-the-Art Static Embeddings
☆1,756Updated this week
gradio-app / fastrtc
The python library for real-time communication
☆4,128Updated last week
mistralai / mistral-finetune
☆2,984Updated 10 months ago
AnswerDotAI / RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,569Updated 2 months ago
run-llama / llama_cloud_services
Knowledge Agents and Management in the Cloud
☆4,052Updated this week
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,274Updated last month
michaelfeil / infinity
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
☆2,306Updated 2 weeks ago
AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,499Updated last month
MLSysOps / MLE-agent
🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…
☆1,313Updated this week
merveenoyan / smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
☆1,520Updated last week
truefoundry / cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
☆4,147Updated 4 months ago
roboflow / maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
☆2,592Updated last week
meta-llama / llama-stack
Composable building blocks to build Llama Apps
☆7,907Updated this week
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,806Updated this week
ucbepic / docetl
A system for agentic LLM-powered data processing and ETL
☆2,354Updated last week
ragapp / ragapp
The easiest way to use Agentic RAG in any enterprise
☆4,284Updated 5 months ago
pytorch / torchtune
PyTorch native post-training library
☆5,323Updated last week
mistralai / cookbook
☆1,902Updated last week
huggingface / datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
☆2,473Updated this week
illuin-tech / colpali
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,009Updated last week
iterative / datachain
ETL, Analytics, Versioning for Unstructured Data
☆2,606Updated this week
Lightning-AI / lightning-thunder
PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…
☆1,375Updated last week
Lightning-AI / litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆12,474Updated last week
argilla-io / argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆4,577Updated last week
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆2,803Updated last week
langwatch / langwatch
The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨
☆2,207Updated this week