Lightning-AI / LitServeLinks
Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.
☆3,607Updated this week
Alternatives and similar repositories for LitServe
Users that are interested in LitServe are comparing it to the libraries listed below
Sorting:
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,840Updated 3 weeks ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,264Updated 2 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,526Updated 5 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,462Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,723Updated 5 months ago
- Deploy your agentic worfklows to production☆2,058Updated 2 months ago
- Fast State-of-the-Art Static Embeddings☆1,872Updated 2 weeks ago
- Knowledge Agents and Management in the Cloud☆4,190Updated last week
- A system for agentic LLM-powered data processing and ETL☆3,010Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,137Updated last week
- Composable building blocks to build Llama Apps☆8,123Updated last week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,753Updated last week
- ☆3,037Updated last year
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,617Updated last month
- ETL, Analytics, Versioning for Unstructured Data☆2,691Updated last week
- 🦾 Take control of your AI agents☆1,380Updated 2 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,527Updated this week
- 🦜⛏️ Did you say you like data?☆1,173Updated 2 weeks ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,508Updated 5 months ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,641Updated this week
- The easiest way to use Agentic RAG in any enterprise☆4,345Updated 9 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,861Updated 3 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,365Updated last year
- Rapidly build AI apps in Python☆6,467Updated 3 weeks ago
- 🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…☆1,396Updated 3 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,729Updated last week
- PyTorch native post-training library☆5,564Updated this week
- The python library for real-time communication☆4,373Updated last month
- ☆2,041Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,769Updated this week