codelion / optillm
Optimizing inference proxy for LLMs
☆2,110Updated this week
Alternatives and similar repositories for optillm:
Users that are interested in optillm are comparing it to the libraries listed below
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,568Updated this week
- Synthetic data curation for post-training and structured data extraction☆1,049Updated this week
- Recipes to scale inference-time compute of open models☆1,041Updated 3 weeks ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,572Updated 3 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,742Updated 3 weeks ago
- ☆838Updated 6 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,338Updated last month
- [ICLR 2025] Automated Design of Agentic Systems☆1,225Updated last month
- ☆1,011Updated 3 months ago
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆986Updated last month
- [ICLR 2025] Agent S: an open agentic framework that uses computers like a human☆1,321Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,729Updated 7 months ago
- AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.☆803Updated 3 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,802Updated 2 weeks ago
- RAG that intelligently adapts to your use case, data, and queries☆3,042Updated 3 weeks ago
- System 2 Reasoning Link Collection☆811Updated last week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,271Updated this week
- Fast State-of-the-Art Static Embeddings☆1,109Updated 3 weeks ago
- Deploy your agentic worfklows to production☆1,981Updated 2 weeks ago
- High-performance retrieval engine for unstructured data☆1,272Updated this week
- Everything about the SmolLM2 and SmolVLM family of models☆2,035Updated last week
- The Open Source Memory Layer For Autonomous Agents☆2,041Updated 5 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,084Updated 2 months ago
- This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated …☆1,096Updated 2 weeks ago
- Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.☆1,367Updated last month
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,045Updated this week
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆871Updated 10 months ago