athina-ai / athina-evals
Python SDK for running evaluations on LLM generated responses
☆196Updated this week
Related projects: ⓘ
- Prompt engineering, automated.☆201Updated this week
- The only Vector tooling you'll need. Star the repo and look out for an email to try out a brand new Vector Data Exploration demo! Use the…☆195Updated this week
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆425Updated last week
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆335Updated 4 months ago
- An AGentic Intelligence Operating System☆282Updated this week
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆352Updated this week
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆72Updated last week
- Data-Driven Evaluation for LLM-Powered Applications☆432Updated 2 weeks ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆208Updated last month
- 🦜💯 Flex those feathers!☆227Updated last month
- FastAPI wrapper around DSPy☆201Updated 6 months ago
- ☆172Updated 4 months ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆432Updated 4 months ago
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆150Updated this week
- The first AI Agent Server, Eidolon is a pluggable Agent SDK and enterprise ready, deployment server for Agentic applications☆218Updated this week
- LLM fine-tuning and eval☆340Updated 5 months ago
- A simple Python sandbox for helpful LLM data agents☆143Updated 3 months ago
- Action library for AI Agent☆187Updated this week
- Task-based Agentic Framework using StrictJSON as the core☆334Updated last week
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆259Updated 6 months ago
- ☆219Updated 10 months ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆387Updated 9 months ago
- A tool for evaluating LLMs☆377Updated 4 months ago
- Python client library for improving your LLM app accuracy☆94Updated this week
- Open-source RAG evaluation through users' feedback☆155Updated 5 months ago
- Tutorial for building LLM router☆145Updated 2 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆135Updated 5 months ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆282Updated 3 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆79Updated 7 months ago
- An Awesome list of curated DSPy resources.☆192Updated last week