kolenaIO / autoarenaLinks
Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
☆108Updated last year
Alternatives and similar repositories for autoarena
Users that are interested in autoarena are comparing it to the libraries listed below
Sorting:
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆83Updated last year
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆147Updated last year
- Testing and evaluation framework for voice agents☆162Updated 7 months ago
- Dynamic Metadata based RAG Framework☆78Updated last month
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- ☆125Updated 11 months ago
- Tutorial for building LLM router☆242Updated last year
- Routing on Random Forest (RoRF)☆239Updated last year
- ☆74Updated last year
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆115Updated 6 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆141Updated 5 months ago
- Agentic RAG to help you build a startup🚀☆55Updated 9 months ago
- Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.☆235Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆264Updated last year
- A memory framework for Large Language Models and Agents.☆181Updated last year
- A project that enables identification and classification of an intent of a message with dynamic labels☆50Updated last year
- Open-source RAG evaluation through users' feedback☆215Updated last year
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆87Updated last year
- low-code multi-agent automation framework☆264Updated 3 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆78Updated 9 months ago
- ☆125Updated last year
- simplifies the process of creating and managing LLM workflows.☆113Updated last year
- Graphite Agentic Framework by Binome Technologies☆172Updated 3 weeks ago
- Deep Research for your internal data☆353Updated 7 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆84Updated last year
- DIffbot LLM Inference Server☆227Updated 5 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆308Updated last week
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆148Updated last year