bentoml / llm-optimizerLinks
Benchmark and optimize LLM inference across frameworks with ease
☆161Updated 4 months ago
Alternatives and similar repositories for llm-optimizer
Users that are interested in llm-optimizer are comparing it to the libraries listed below
Sorting:
- ☆238Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- Codebase for FinePDFs☆176Updated last month
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆460Updated 5 months ago
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆139Updated 3 weeks ago
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end refere…☆392Updated last week
- ☆80Updated 4 months ago
- DSPydantic: Auto-Optimize Your Prompts and Pydantic Models with DSPy☆244Updated last week
- A Text-Based Environment for Interactive Debugging☆294Updated this week
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 10 months ago
- ScalarLM - a unified training and inference stack☆97Updated 2 months ago
- A multi-agent LLM system for detecting and resolving cognitive dissonance.☆274Updated 3 months ago
- RapidFire AI: Rapid AI Customization from RAG to Fine-Tuning☆138Updated this week
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆100Updated this week
- syftr is an agent optimizer that helps you find the best agentic workflows for your budget.☆326Updated 2 weeks ago
- ☆170Updated 2 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆117Updated last month
- Building blocks for agents in C++☆139Updated 2 weeks ago
- Self-host LLMs with vLLM and BentoML☆168Updated 2 weeks ago
- Routing on Random Forest (RoRF)☆239Updated last year
- ☆43Updated 3 months ago
- Pivotal Token Search☆144Updated last month
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆108Updated last year
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆133Updated 11 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year