kolenaIO / autoarena
Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
☆99Updated last month
Related projects ⓘ
Alternatives and complementary repositories for autoarena
- A toolkit for building multimodal AI agents☆111Updated this week
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆137Updated 7 months ago
- ☆105Updated last month
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with use…☆43Updated 3 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆74Updated 2 months ago
- ☆56Updated last month
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆50Updated last month
- Testing and evaluation framework for voice agents☆48Updated this week
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆67Updated this week
- AI agent with RAG+ReAct on Indian Constitution & BNS☆54Updated 3 weeks ago
- ☆50Updated last month
- A fork of OpenAI Swarm that supports Groq and Anthropic☆85Updated last month
- A memory framework for Large Language Models and Agents.☆162Updated 3 months ago
- Research assistant for performing online research on a given topic, using Llamaindex Workflows and Tavily API. Inspired by GPT-Researcher☆149Updated last month
- Chat with PDF files with source highlights☆67Updated last week
- ☆46Updated last month
- Lyzr SDKs help you to build all your favorite GenAI SaaS products as enterprise applications in minutes.☆164Updated 4 months ago
- 🤖 Headless IDE for AI agents☆132Updated this week
- Automated knowledge graph creation SDK☆112Updated 4 months ago
- Routing on Random Forest (RoRF)☆84Updated last month
- RepoGPT: AI-powered GitHub assistant to chat, manage, and explore your repos effortlessly.☆188Updated last month
- ☆53Updated 3 weeks ago
- This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clini…☆172Updated last month
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆110Updated last month
- Dynamic Metadata based RAG Framework☆71Updated 3 months ago
- ☆87Updated 10 months ago
- Declarative framework to build LLM-based applications☆99Updated last week
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆102Updated 3 months ago
- ☆114Updated 5 months ago
- The long-term memory for your Superagents 🥷and LLMs 🤖. Built with GraphRAG, Knowledge graphs and autonomous ai agents☆44Updated last month