kolenaIO / autoarena
Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
☆98Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for autoarena
- Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with B…☆34Updated 3 weeks ago
- A toolkit for building multimodal AI agents☆108Updated 2 weeks ago
- ☆105Updated last month
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆137Updated 7 months ago
- Declarative framework to build LLM-based applications☆87Updated this week
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated 7 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆74Updated 2 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆79Updated 9 months ago
- Connect Data Silos with Reliable AI⚡🚀☆158Updated this week
- A fork of OpenAI Swarm that supports Groq and Anthropic☆82Updated 3 weeks ago
- Dynamiq is an orchestration framework for agentic AI and LLM applications☆232Updated this week
- ☆55Updated last month
- Dynamic Metadata based RAG Framework☆71Updated 3 months ago
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆110Updated 3 weeks ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆44Updated last month
- Lyzr SDKs help you to build all your favorite GenAI SaaS products as enterprise applications in minutes.☆161Updated 3 months ago
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with use…☆43Updated 2 months ago
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆67Updated last week
- Data extraction with LLM on CPU☆109Updated 10 months ago
- ☆61Updated 3 weeks ago
- This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clini…☆171Updated last month
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆93Updated this week
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆50Updated last week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆168Updated this week
- Embed anything.☆29Updated 5 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆77Updated last month
- Solving data for LLMs - Create quality synthetic datasets!☆136Updated 3 weeks ago
- Open-source RAG evaluation through users' feedback☆160Updated 6 months ago
- Welcome to the Natural Language to SQL demo project using LlamaIndex! This application is designed to demonstrate the innovative use of L…☆67Updated 7 months ago
- 🤖 Headless IDE for AI agents☆129Updated this week