A Pythonic framework to simplify AI service building
☆2,808Jan 31, 2026Updated last month
Alternatives and similar repositories for leptonai
Users that are interested in leptonai are comparing it to the libraries listed below
Sorting:
- Building a quick conversation-based search demo with Lepton AI.☆8,110Dec 2, 2025Updated 3 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,844Nov 27, 2024Updated last year
- SGLang is a high-performance serving framework for large language models and multimodal models.☆24,455Updated this week
- Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.☆11,062Updated this week
- LlamaIndex is the leading document agent and OCR platform☆47,608Updated this week
- Universal memory layer for AI Agents☆49,365Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,807Updated this week
- High-speed Large Language Model Serving for Local Deployment☆8,797Jan 24, 2026Updated last month
- Question and Answer based on Anything.☆13,881Mar 24, 2025Updated 11 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆72,827Updated this week
- Perplexity Inspired Answer Engine☆5,021Jun 27, 2025Updated 8 months ago
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,925Nov 25, 2024Updated last year
- Large World Model -- Modeling Text and Video with Millions Context☆7,399Oct 19, 2024Updated last year
- MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks☆8,707Feb 11, 2026Updated last month
- Making large AI models cheaper, faster and more accessible☆41,360Mar 9, 2026Updated last week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,161Mar 9, 2026Updated last week
- Modeling, training, eval, and inference code for OLMo☆6,388Nov 24, 2025Updated 3 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,670Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,418Jun 2, 2025Updated 9 months ago
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆38,879Updated this week
- Universal LLM Deployment Engine with ML Compilation☆22,194Mar 9, 2026Updated last week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,579Updated this week
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,723Nov 18, 2024Updated last year
- Large Language Model Text Generation Inference☆10,803Jan 8, 2026Updated 2 months ago
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,165Oct 30, 2025Updated 4 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,201Jul 11, 2024Updated last year
- Inference code for CodeLlama models☆16,350Aug 12, 2024Updated last year
- A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.☆10,899Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,956Jul 11, 2025Updated 8 months ago
- AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents☆18,241Updated this week
- The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.☆6,128Feb 3, 2026Updated last month
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆13,057Updated this week
- Build, run, manage agentic software at scale.☆38,700Updated this week
- ☆6,752Jun 26, 2025Updated 8 months ago
- Build ChatGPT over your data, all with natural language☆6,534Apr 5, 2024Updated last year
- A programming framework for agentic AI☆55,559Updated this week
- An Autonomous LLM Agent for Complex Task Solving☆8,509Aug 12, 2024Updated last year
- Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a cro…☆39,703Updated this week
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,883Sep 26, 2024Updated last year