athina-ai / ariadne
LLM Evals for Text Summarization and RAG use-cases.
☆35Updated last year
Alternatives and similar repositories for ariadne:
Users that are interested in ariadne are comparing it to the libraries listed below
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Logging and caching superpowers for the openai sdk☆105Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Updated 7 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆76Updated 2 months ago
- Synthetic Data for LLM Fine-Tuning☆115Updated last year
- ☆75Updated last year
- Fiddler Auditor is a tool to evaluate language models.☆179Updated last year
- A strongly typed Python DSL for developing message passing multi agent systems☆52Updated last year
- ☆195Updated last year
- ☆57Updated last year
- Leverage your LangChain trace data for fine tuning☆41Updated 9 months ago
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆185Updated last year
- ☆29Updated 10 months ago
- Prompt engineering, automated.☆304Updated last week
- Python SDK for running evaluations on LLM generated responses☆278Updated last week
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆237Updated last week
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.☆104Updated last year
- Fluid Database☆114Updated 7 months ago
- A simple DAG for executing LLM calls and using tools.☆41Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆74Updated last year
- data cleaning and curation for unstructured text☆329Updated 9 months ago
- Text to Python Objects via a LLM Function Call☆57Updated last year
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆175Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Data-Driven Evaluation for LLM-Powered Applications☆489Updated 3 months ago
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆98Updated last year
- ☆91Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆103Updated last year
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆64Updated last week