athina-ai / athina-evalsLinks
Python SDK for running evaluations on LLM generated responses
☆295Updated 7 months ago
Alternatives and similar repositories for athina-evals
Users that are interested in athina-evals are comparing it to the libraries listed below
Sorting:
- Prompt engineering, automated.☆351Updated 8 months ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆264Updated 3 weeks ago
- A simple Python sandbox for helpful LLM data agents☆302Updated last year
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆246Updated last year
- A tool for evaluating LLMs☆428Updated last year
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆386Updated last year
- An Awesome list of curated DSPy resources.☆504Updated last month
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆264Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆155Updated last year
- Action library for AI Agent☆229Updated 9 months ago
- 🦜💯 Flex those feathers!☆255Updated last year
- Data-Driven Evaluation for LLM-Powered Applications☆515Updated 11 months ago
- Legacy project of an analytics platform for LLM-generated content☆439Updated 5 months ago
- The first AI Agent Server, Eidolon is a pluggable Agent SDK and enterprise ready, deployment server for Agentic applications☆480Updated 10 months ago
- FastAPI wrapper around DSPy☆290Updated last year
- Routing on Random Forest (RoRF)☆237Updated last year
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆408Updated 4 months ago
- Tutorial for building LLM router☆241Updated last year
- Open-source RAG evaluation through users' feedback☆213Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆82Updated 11 months ago
- ☆198Updated last year
- The open-source SDK for creating AI plugins and actions☆298Updated 8 months ago
- AGI SDK☆380Updated 3 weeks ago
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆326Updated last year
- The easiest, and fastest way to run AI-generated Python code safely☆354Updated last year
- Deep Research for your internal data☆352Updated 7 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆147Updated last year
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆722Updated this week
- Task-based Agentic Framework using StrictJSON as the core☆462Updated last month
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆114Updated 6 months ago