athina-ai / athina-evalsLinks
Python SDK for running evaluations on LLM generated responses
☆280Updated 2 weeks ago
Alternatives and similar repositories for athina-evals
Users that are interested in athina-evals are comparing it to the libraries listed below
Sorting:
- Prompt engineering, automated.☆321Updated last month
- A tool for evaluating LLMs☆419Updated last year
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆242Updated last week
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆239Updated 7 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆77Updated 3 months ago
- Action library for AI Agent☆214Updated 2 months ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆374Updated last year
- Data-Driven Evaluation for LLM-Powered Applications☆493Updated 4 months ago
- A curated list of open source repositories for AI Engineers☆112Updated 2 months ago
- Tutorial for building LLM router☆206Updated 10 months ago
- A simple Python sandbox for helpful LLM data agents☆262Updated 11 months ago
- OpenTelemetry Instrumentation for AI Observability☆442Updated this week
- Synthetic Data for LLM Fine-Tuning☆116Updated last year
- An Intelligence Operating System☆324Updated this week
- An example of multi-agent orchestration with llama-index☆424Updated 4 months ago
- ☆194Updated last year
- FastAPI wrapper around DSPy☆242Updated last year
- Fine-tuning and serving LLMs on any cloud☆89Updated last year
- ☆164Updated last year
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆130Updated 7 months ago
- Task-based Agentic Framework using StrictJSON as the core☆450Updated last month
- A Lightweight Library for AI Observability☆243Updated 3 months ago
- ☆355Updated 2 weeks ago
- Routing on Random Forest (RoRF)☆161Updated 8 months ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆486Updated last year
- ☆72Updated 7 months ago
- Open-source AI copilot that lets you chat with your observability data and code 🧙♂️☆348Updated last month
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆267Updated 2 months ago
- Testing and evaluation framework for voice agents☆119Updated 3 weeks ago
- Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.☆237Updated this week