bentoml / llm-optimizerLinks
Benchmark and optimize LLM inference across frameworks with ease
☆161Updated 4 months ago
Alternatives and similar repositories for llm-optimizer
Users that are interested in llm-optimizer are comparing it to the libraries listed below
Sorting:
- ☆238Updated 2 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated last week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆460Updated 5 months ago
- Codebase for FinePDFs☆174Updated last month
- syftr is an agent optimizer that helps you find the best agentic workflows for your budget.☆326Updated 2 weeks ago
- DSPydantic: Auto-Optimize Your Prompts and Pydantic Models with DSPy☆244Updated last week
- ☆43Updated 3 months ago
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆133Updated 11 months ago
- ☆80Updated 4 months ago
- ☆127Updated 4 months ago
- ☆170Updated 2 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆274Updated 3 months ago
- How to build the best search, one step at a time!☆233Updated 2 months ago
- Fastest way to build and deploy reliable AI agents, MCP tools and agent-to-agent. Deploy in a production ready serverless environment.☆147Updated this week
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- RapidFire AI: Rapid AI Customization from RAG to Fine-Tuning☆138Updated this week
- ScalarLM - a unified training and inference stack☆97Updated 2 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Train LLM on Hugging Face infra☆67Updated 2 months ago
- [ICLR2026] Test-Time Scaling with Reflective Generative Model☆302Updated last week
- ☆67Updated 8 months ago
- Building blocks for agents in C++☆139Updated 2 weeks ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end refere…☆392Updated this week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 10 months ago
- ☆159Updated 9 months ago