athina-ai / athina-evals
Python SDK for running evaluations on LLM generated responses
☆253Updated last week
Alternatives and similar repositories for athina-evals:
Users that are interested in athina-evals are comparing it to the libraries listed below
- Data-Driven Evaluation for LLM-Powered Applications☆463Updated last week
- Prompt engineering, automated.☆260Updated last month
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆349Updated 8 months ago
- 🦜💯 Flex those feathers!☆236Updated 2 months ago
- Tutorial for building LLM router☆170Updated 5 months ago
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆395Updated last week
- Action library for AI Agent☆206Updated this week
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆233Updated 3 months ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆462Updated 8 months ago
- An Awesome list of curated DSPy resources.☆262Updated 4 months ago
- A tool for evaluating LLMs☆397Updated 8 months ago
- OpenTelemetry Instrumentation for AI Observability☆254Updated this week
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆680Updated this week
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆224Updated this week
- ☆259Updated 5 months ago
- An Intelligence Operating System☆310Updated this week
- Open-source RAG evaluation through users' feedback☆166Updated 9 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆204Updated this week
- A simple Python sandbox for helpful LLM data agents☆209Updated 6 months ago
- Synthetic Data for LLM Fine-Tuning☆107Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆146Updated 3 months ago
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆262Updated 9 months ago
- ☆197Updated last month
- FastAPI wrapper around DSPy☆229Updated 10 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆276Updated 2 months ago
- Structured information extraction from documents☆297Updated 3 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆79Updated 11 months ago
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆324Updated this week
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆540Updated this week
- Automated knowledge graph creation SDK☆119Updated last month