kolenaIO / autoarena
Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
☆102Updated 2 months ago
Alternatives and similar repositories for autoarena:
Users that are interested in autoarena are comparing it to the libraries listed below
- Testing and evaluation framework for voice agents☆97Updated 3 weeks ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆142Updated 11 months ago
- A Lightweight Library for AI Observability☆236Updated 3 weeks ago
- ☆77Updated 5 months ago
- Dynamic Metadata based RAG Framework☆72Updated 7 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆108Updated last week
- Multi-Agents using Workflows☆46Updated 2 months ago
- A toolkit for building computer use AI agents☆148Updated this week
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆89Updated 5 months ago
- Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.☆231Updated 4 months ago
- ☆67Updated 5 months ago
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with use…☆48Updated 6 months ago
- ☆118Updated last week
- Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with B…☆42Updated 5 months ago
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆107Updated 7 months ago
- Routing on Random Forest (RoRF)☆130Updated 5 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆80Updated last year
- simplifies the process of creating and managing LLM workflows.☆97Updated 4 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated last month
- ☆53Updated 4 months ago
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆76Updated last month
- Lyzr SDKs help you to build all your favorite GenAI SaaS products as enterprise applications in minutes.☆175Updated 3 months ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆122Updated 4 months ago
- ☆71Updated 4 months ago
- Tutorial for building LLM router☆186Updated 7 months ago
- Declarative framework to build LLM-based applications☆115Updated 4 months ago
- This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clini…☆190Updated 5 months ago