codelion / optillm
Optimizing inference proxy for LLMs
☆2,201Updated last week
Alternatives and similar repositories for optillm:
Users that are interested in optillm are comparing it to the libraries listed below
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,671Updated last week
- Synthetic data curation for post-training and structured data extraction☆1,290Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,491Updated last month
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,656Updated 4 months ago
- [ICLR 2025] Automated Design of Agentic Systems☆1,278Updated 3 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,884Updated 8 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,400Updated last month
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,964Updated last week
- A reading list on LLM based Synthetic Data Generation 🔥☆1,255Updated 2 months ago
- ☆1,017Updated 4 months ago
- ☆863Updated 7 months ago
- A library for advanced large language model reasoning☆2,113Updated 3 weeks ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,299Updated 3 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,488Updated 2 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,408Updated 2 weeks ago
- Harness LLMs with Multi-Agent Programming☆3,265Updated this week
- Recipes to scale inference-time compute of open models☆1,066Updated 2 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,482Updated this week
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,078Updated 3 months ago
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆1,132Updated 2 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆2,971Updated last month
- System 2 Reasoning Link Collection☆828Updated last month
- High-performance retrieval engine for unstructured data☆1,364Updated this week
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆751Updated 9 months ago
- The Open Source Memory Layer For Autonomous Agents☆2,190Updated 6 months ago
- ☆1,356Updated 5 months ago
- Deploy your agentic worfklows to production☆2,002Updated last week
- AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.☆876Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddings☆1,563Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆930Updated last week