anyscale / llm-routerLinks
Tutorial for building LLM router
☆206Updated 10 months ago
Alternatives and similar repositories for llm-router
Users that are interested in llm-router are comparing it to the libraries listed below
Sorting:
- A Lightweight Library for AI Observability☆243Updated 3 months ago
- A simple Python sandbox for helpful LLM data agents☆262Updated 11 months ago
- Synthetic Data for LLM Fine-Tuning☆116Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- Routing on Random Forest (RoRF)☆161Updated 8 months ago
- Together Open Deep Research☆298Updated last month
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆289Updated 2 weeks ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆148Updated 4 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆344Updated 11 months ago
- Task-based Agentic Framework using StrictJSON as the core☆450Updated last month
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated 10 months ago
- ☆194Updated last year
- Simple UI for debugging correlations of text embeddings☆180Updated this week
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- An Awesome list of curated DSPy resources.☆326Updated 3 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆130Updated last month
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆305Updated 2 months ago
- FastAPI wrapper around DSPy☆242Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆169Updated 8 months ago
- Fast parallel LLM inference for MLX☆189Updated 10 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆130Updated 3 weeks ago
- Testing and evaluation framework for voice agents☆119Updated 3 weeks ago
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆263Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆235Updated 7 months ago
- 🤖 Headless IDE for AI agents☆188Updated last month
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆443Updated 8 months ago
- Efficient vector database for hundred millions of embeddings.☆205Updated last year
- A curated list of awesome approaches to AI model routing☆116Updated 2 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆145Updated last year