kolenaIO / autoarenaLinks
Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
☆105Updated 7 months ago
Alternatives and similar repositories for autoarena
Users that are interested in autoarena are comparing it to the libraries listed below
Sorting:
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆146Updated last year
- Routing on Random Forest (RoRF)☆181Updated 10 months ago
- Dynamic Metadata based RAG Framework☆75Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆76Updated 9 months ago
- Tutorial for building LLM router☆220Updated last year
- ☆74Updated 10 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆154Updated this week
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆163Updated 10 months ago
- ☆123Updated last year
- ☆122Updated 5 months ago
- Open-source RAG evaluation through users' feedback☆194Updated last year
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆112Updated 3 weeks ago
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆129Updated last month
- A project that enables identification and classification of an intent of a message with dynamic labels☆43Updated 7 months ago
- A list of AI memory projects☆179Updated 6 months ago
- Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.☆234Updated 8 months ago
- Testing and evaluation framework for voice agents☆129Updated last month
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆78Updated 5 months ago
- A toolkit for building computer use AI agents☆170Updated last month
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆133Updated last month
- low-code multi-agent automation framework☆255Updated last year
- Lyzr SDKs help you to build all your favorite GenAI SaaS products as enterprise applications in minutes.☆181Updated 7 months ago
- A memory framework for Large Language Models and Agents.☆183Updated 7 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆146Updated last year
- A Lightweight Library for AI Observability☆249Updated 5 months ago
- ☆87Updated 2 months ago
- ☆53Updated 9 months ago
- A fork of OpenAI Swarm that supports Groq and Anthropic☆121Updated 5 months ago
- RAG example using DSPy, Gradio, FastAPI☆83Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆264Updated last year