athina-ai / athina-evalsLinks
Python SDK for running evaluations on LLM generated responses
☆292Updated 4 months ago
Alternatives and similar repositories for athina-evals
Users that are interested in athina-evals are comparing it to the libraries listed below
Sorting:
- Prompt engineering, automated.☆346Updated 6 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆247Updated last year
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆263Updated last week
- An Awesome list of curated DSPy resources.☆458Updated 2 weeks ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆384Updated last year
- A tool for evaluating LLMs☆424Updated last year
- Data-Driven Evaluation for LLM-Powered Applications☆507Updated 9 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated last year
- A simple Python sandbox for helpful LLM data agents☆284Updated last year
- Tutorial for building LLM router☆231Updated last year
- Deep Research for your internal data☆341Updated 4 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆336Updated last month
- A curated list of open source repositories for AI Engineers☆119Updated 7 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆79Updated 8 months ago
- Routing on Random Forest (RoRF)☆214Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆262Updated last year
- Structured information extraction from documents☆317Updated last year
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆66Updated this week
- The easiest, and fastest way to run AI-generated Python code safely☆335Updated 10 months ago
- Legacy project of an analytics platform for LLM-generated content☆436Updated 3 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 3 months ago
- Action library for AI Agent☆224Updated 6 months ago
- Task-based Agentic Framework using StrictJSON as the core☆458Updated 3 weeks ago
- An example of multi-agent orchestration with llama-index☆432Updated 8 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆135Updated last month
- 🦜💯 Flex those feathers!☆252Updated last year
- Open-source AI copilot that lets you chat with your observability data and code 🧙♂️☆354Updated 5 months ago
- AGI SDK☆368Updated last week
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆288Updated 3 months ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆397Updated last year