Lightning-AI / LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
☆2,489Updated this week
Related projects ⓘ
Alternatives and complementary repositories for LitServe
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,322Updated this week
- ☆2,746Updated 2 months ago
- The easiest way to use Agentic RAG in any enterprise☆3,866Updated this week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆2,487Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,383Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,256Updated 3 months ago
- 🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…☆1,096Updated this week
- Composable building blocks to build Llama Apps☆4,594Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆3,677Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,830Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,057Updated 2 months ago
- Build real-time multimodal AI applications 🤖🎙️📹☆4,010Updated this week
- PyTorch native quantization and sparsity for training and inference☆1,585Updated this week
- Parse files for optimal RAG☆3,173Updated last week
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆1,415Updated this week
- A native PyTorch Library for large model training☆2,623Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,526Updated this week
- PyTorch native finetuning library☆4,336Updated this week
- A language model programming library.☆5,295Updated this week
- ☆1,271Updated 2 weeks ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆2,074Updated this week
- Efficient Triton Kernels for LLM Training☆3,454Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,227Updated last week
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,199Updated this week
- Blazingly fast LLM inference.☆4,472Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆890Updated 2 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆3,540Updated 2 weeks ago
- A system for agentic LLM-powered data processing and ETL☆1,269Updated this week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆2,602Updated last week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆1,473Updated this week