Lightning-AI / LitServeLinks
The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.
☆3,513Updated last week
Alternatives and similar repositories for LitServe
Users that are interested in LitServe are comparing it to the libraries listed below
Sorting:
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,394Updated 3 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,733Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,317Updated last week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,607Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddings☆1,807Updated 2 weeks ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,193Updated 6 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,592Updated last week
- A system for agentic LLM-powered data processing and ETL☆2,722Updated this week
- ETL, Analytics, Versioning for Unstructured Data☆2,623Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,390Updated last week
- Knowledge Agents and Management in the Cloud☆4,121Updated this week
- A blazing fast inference solution for text embeddings models☆3,942Updated last week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,523Updated 3 months ago
- ☆3,009Updated 11 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,647Updated 3 months ago
- Deploy your agentic worfklows to production☆2,053Updated 2 weeks ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,465Updated 3 months ago
- ☆1,957Updated last week
- The python library for real-time communication☆4,249Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,572Updated 2 weeks ago
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,393Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,863Updated this week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,630Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,270Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,388Updated this week
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆4,205Updated last month
- The easiest way to use Agentic RAG in any enterprise☆4,308Updated 7 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆851Updated last month
- Open-source AI cookbook☆2,212Updated 3 weeks ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,271Updated last week