kolenaIO / autoarenaLinks
Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
☆104Updated 5 months ago
Alternatives and similar repositories for autoarena
Users that are interested in autoarena are comparing it to the libraries listed below
Sorting:
- ☆122Updated 3 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆77Updated 3 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆145Updated last year
- ☆71Updated 8 months ago
- Dynamic Metadata based RAG Framework☆75Updated 10 months ago
- A Lightweight Library for AI Observability☆243Updated 3 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆130Updated last month
- ☆86Updated 3 weeks ago
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆80Updated 2 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆91Updated 7 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆70Updated 7 months ago
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with use…☆51Updated 9 months ago
- ☆89Updated last year
- A curated list of open source repositories for AI Engineers☆112Updated 2 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆148Updated 4 months ago
- Tutorial for building LLM router☆207Updated 10 months ago
- Routing on Random Forest (RoRF)☆161Updated 8 months ago
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆50Updated 6 months ago
- RAG with postgreSQL(nebius) and pgvector☆24Updated 6 months ago
- A project that enables identification and classification of an intent of a message with dynamic labels☆39Updated 5 months ago
- An assistant for Slack built with Arcade and Langgraph. Interact with Google Calendar, Mail, Github, Search Engines, Firecrawl and more a…☆84Updated 2 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆37Updated last year
- ☆53Updated 7 months ago
- This project implements a demonstrator agent that compares the Cache-Augmented Generation (CAG) Framework with traditional Retrieval-Augm…☆32Updated 5 months ago
- Lyzr SDKs help you to build all your favorite GenAI SaaS products as enterprise applications in minutes.☆178Updated 5 months ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆162Updated 8 months ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆85Updated last year
- ☆61Updated 2 months ago
- ☆72Updated 7 months ago
- ☆34Updated last month