codelion / optillmLinks

Optimizing inference proxy for LLMs

☆2,695

Alternatives and similar repositories for optillm

Users that are interested in optillm are comparing it to the libraries listed below

Sorting:

bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆1,464Updated 3 weeks ago
OpenAutoCoder / Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
☆1,831Updated 7 months ago
NVIDIA / RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
☆1,205Updated last week
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,821Updated this week
dCaples / AutoDidact
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆648Updated 4 months ago
e-p-armstrong / augmentoolkit
Create Custom LLMs
☆1,690Updated last week
ShengranHu / ADAS
[ICLR 2025] Automated Design of Agentic Systems
☆1,392Updated 6 months ago
lm-sys / RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,132Updated 11 months ago
willccbb / verifiers
Verifiers for LLM Reinforcement Learning
☆1,621Updated this week
NousResearch / Hermes-Function-Calling
☆1,022Updated 10 months ago
zou-group / textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
☆2,796Updated last week
WecoAI / aideml
AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.
☆972Updated this week
HazyResearch / minions
Big & Small LLMs working together
☆1,088Updated this week
noamgat / lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
☆1,858Updated 5 months ago
aphrodite-engine / aphrodite-engine
Large-scale LLM inference engine
☆1,482Updated last week
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,333Updated 2 months ago
SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,131Updated 6 months ago
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,110Updated 2 months ago
AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,505Updated 2 months ago
trotsky1997 / MathBlackBox
☆1,028Updated 7 months ago
langroid / langroid
Harness LLMs with Multi-Agent Programming
☆3,545Updated last week
huggingface / evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…
☆1,498Updated 6 months ago
open-thought / system-2-research
System 2 Reasoning Link Collection
☆848Updated 4 months ago
illuin-tech / colpali
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,088Updated this week
octotools / octotools
OctoTools: An agentic framework with extensible tools for complex reasoning
☆1,319Updated last week
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆3,032Updated this week
caspianmoon / memoripy
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
☆651Updated 6 months ago
D-Star-AI / dsRAG
High-performance retrieval engine for unstructured data
☆1,459Updated this week
meta-llama / synthetic-data-kit
Tool for generating high quality Synthetic datasets
☆1,081Updated last week
EpistasisLab / KRAGEN
Software to implement GoT with a weviate vectorized database
☆674Updated 4 months ago