Lightning-AI / LitServeLinks
Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.
☆3,728Updated last week
Alternatives and similar repositories for LitServe
Users that are interested in LitServe are comparing it to the libraries listed below
Sorting:
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,905Updated last week
- Fast State-of-the-Art Static Embeddings☆1,939Updated 3 weeks ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,293Updated 2 weeks ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,542Updated last week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,774Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,620Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,789Updated 6 months ago
- A system for agentic LLM-powered data processing and ETL☆3,204Updated last week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,642Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,566Updated 6 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,517Updated 6 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,799Updated last month
- ☆2,088Updated 2 weeks ago
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,429Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,119Updated this week
- ☆3,040Updated 2 weeks ago
- The easiest way to use Agentic RAG in any enterprise☆4,369Updated 10 months ago
- 🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…☆1,470Updated 4 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,581Updated 6 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,374Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,575Updated 2 weeks ago
- Deploy your agentic worfklows to production☆2,064Updated last week
- Knowledge Agents and Management in the Cloud☆4,218Updated last week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,455Updated last year
- Open-source AI cookbook☆2,525Updated last month
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,672Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,388Updated 7 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,153Updated this week
- LLM abstractions that aren't obstructions☆1,318Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,510Updated last month