algorithmicsuperintelligence/optillm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/algorithmicsuperintelligence/optillm)

algorithmicsuperintelligence / optillm

Optimizing inference proxy for LLMs

☆4,177

Alternatives and similar repositories for optillm

Users that are interested in optillm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

algorithmicsuperintelligence / openevolve
View on GitHub
Open-source implementation of AlphaEvolve
☆6,745Updated this week
katanemo / plano
View on GitHub
Plano is an AI-native proxy server and data plane for agentic apps. Smart LLM routing, observability, agent orchestration, and guardrails…
☆6,871Updated this week
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆14,547Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,221Updated this week
xjdr-alt / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆3,435Nov 13, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
codelion / pts
View on GitHub
Pivotal Token Search
☆154Updated this week
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,243Jun 17, 2026Updated last month
OpenPipe / ART
View on GitHub
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…
☆10,492Updated this week
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,215Updated this week
neuml / txtai
View on GitHub
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
☆12,733Updated this week
BerriAI / litellm
View on GitHub
The fastest, litest AI Gateway. Rust core with Python SDK. Call 100+ LLM APIs in OpenAI (or native) format with cost tracking, guardrails…
☆54,020Updated this week
Aider-AI / aider
View on GitHub
aider is AI pair programming in your terminal
☆47,510May 22, 2026Updated last month
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,498Updated this week
confident-ai / deepeval
View on GitHub
The LLM Evaluation Framework
☆16,950Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
letta-ai / letta
View on GitHub
Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.
☆23,864Jul 3, 2026Updated 2 weeks ago
getzep / graphiti
View on GitHub
Build Real-Time Knowledge Graphs for AI Agents
☆28,912Updated this week
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,685May 21, 2026Updated last month
e-p-armstrong / augmentoolkit
View on GitHub
Create Custom LLMs
☆1,858Jun 27, 2026Updated 3 weeks ago
dphnAI / sonar
View on GitHub
Large-scale LLM inference engine
☆1,801Updated this week
OpenHands / OpenHands
View on GitHub
🙌 OpenHands: AI-Driven Development
☆81,293Updated this week
av / harbor
View on GitHub
Stop configuring your AI stack. Start using it. One command brings a complete pre-wired LLM stack with hundreds of services to explore.
☆3,143Updated this week
trotsky1997 / MathBlackBox
View on GitHub
☆1,033Dec 17, 2024Updated last year
LMCache / LMCache
View on GitHub
LMCache: Supercharge Your LLM with the Fastest KV Cache Layer
☆10,706Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
567-labs / instructor
View on GitHub
structured outputs for llms
☆13,568Jul 13, 2026Updated last week
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆86,634Updated this week
turboderp-org / exllamav2
View on GitHub
A fast inference library for running LLMs locally on modern consumer-class GPUs
☆4,586Mar 4, 2026Updated 4 months ago
build-with-groq / g1
View on GitHub
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
☆4,176Dec 30, 2025Updated 6 months ago
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆61,213Updated this week
predibase / lorax
View on GitHub
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,816May 28, 2026Updated last month
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,332Updated this week
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,387Updated this week
SciPhi-AI / R2R
View on GitHub
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
☆7,931Nov 7, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
codelion / adaptive-classifier
View on GitHub
A flexible, adaptive classification system for dynamic text classification
☆567Oct 7, 2025Updated 9 months ago
agno-agi / agno
View on GitHub
Build, run, and manage agent platforms.
☆41,287Updated this week
sentient-agi / ROMA
View on GitHub
Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.
☆5,093Feb 16, 2026Updated 5 months ago
docling-project / docling
View on GitHub
Get your documents ready for gen AI
☆63,456Updated this week
langroid / langroid
View on GitHub
Harness LLMs with Multi-Agent Programming
☆4,077Jul 12, 2026Updated last week
zhentingqi / rStar
View on GitHub
☆972Jan 23, 2025Updated last year
OpenAutoCoder / Agentless
View on GitHub
Agentless🐱: an agentless approach to automatically solve software development problems
☆2,083Dec 22, 2024Updated last year