athina-ai / athina-evals
Python SDK for running evaluations on LLM generated responses
☆221Updated this week
Related projects ⓘ
Alternatives and complementary repositories for athina-evals
- Prompt engineering, automated.☆246Updated 2 weeks ago
- Data-Driven Evaluation for LLM-Powered Applications☆447Updated 2 months ago
- 🦜💯 Flex those feathers!☆234Updated 3 weeks ago
- OpenTelemetry Instrumentation for AI Observability☆218Updated this week
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆341Updated 6 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆222Updated last month
- A tool for evaluating LLMs☆392Updated 6 months ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆444Updated 6 months ago
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆377Updated this week
- An Awesome list of curated DSPy resources.☆226Updated 2 months ago
- Synthetic Data for LLM Fine-Tuning☆97Updated 11 months ago
- ☆182Updated 6 months ago
- Open-source RAG evaluation through users' feedback☆161Updated 7 months ago
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆248Updated this week
- Source available LLM Ops platform and LLM Optimization Studio powered by DSPy.☆341Updated this week
- A simple Python sandbox for helpful LLM data agents☆170Updated 5 months ago
- 🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.☆390Updated this week
- An AGentic Intelligence Operating System☆296Updated last week
- Tutorial for building LLM router☆163Updated 4 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆74Updated 2 months ago
- LLM fine-tuning and eval☆341Updated 8 months ago
- FastAPI wrapper around DSPy☆214Updated 8 months ago
- self-improving user memory framework for conversational AI apps☆145Updated this week
- The first AI Agent Server, Eidolon is a pluggable Agent SDK and enterprise ready, deployment server for Agentic applications☆286Updated this week
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆390Updated 11 months ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆219Updated this week
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆110Updated last month
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆262Updated 8 months ago
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆583Updated this week
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆289Updated 6 months ago