athina-ai / athina-evalsLinks
Python SDK for running evaluations on LLM generated responses
☆292Updated 3 months ago
Alternatives and similar repositories for athina-evals
Users that are interested in athina-evals are comparing it to the libraries listed below
Sorting:
- Prompt engineering, automated.☆343Updated 5 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆248Updated 11 months ago
- An Awesome list of curated DSPy resources.☆439Updated last month
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆260Updated last week
- A tool for evaluating LLMs☆424Updated last year
- Data-Driven Evaluation for LLM-Powered Applications☆506Updated 8 months ago
- Legacy project☆438Updated 2 months ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆387Updated last year
- A simple Python sandbox for helpful LLM data agents☆285Updated last year
- Action library for AI Agent☆224Updated 6 months ago
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆264Updated last year
- 🦜💯 Flex those feathers!☆252Updated 11 months ago
- FastAPI wrapper around DSPy☆271Updated last year
- Task-based Agentic Framework using StrictJSON as the core☆459Updated last month
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆285Updated 2 months ago
- Testing and evaluation framework for voice agents☆151Updated 3 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 11 months ago
- Open-source RAG evaluation through users' feedback☆204Updated last year
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 2 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆79Updated 7 months ago
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆671Updated 3 weeks ago
- Deep Research for your internal data☆338Updated 3 months ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆508Updated last year
- ☆73Updated 11 months ago
- 🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.☆562Updated this week
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆65Updated 2 weeks ago
- Structured information extraction from documents☆317Updated last year
- Tutorial for building LLM router☆228Updated last year
- OpenTelemetry Instrumentation for AI Observability☆608Updated last week
- Routing on Random Forest (RoRF)☆211Updated last year