athina-ai / ariadne
LLM Evals for Text Summarization and RAG use-cases.
☆35Updated last year
Alternatives and similar repositories for ariadne:
Users that are interested in ariadne are comparing it to the libraries listed below
- Synthetic Data for LLM Fine-Tuning☆113Updated last year
- Logging and caching superpowers for the openai sdk☆104Updated last year
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆122Updated 5 months ago
- Leverage your LangChain trace data for fine tuning☆41Updated 8 months ago
- Python SDK for running evaluations on LLM generated responses☆276Updated last week
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.☆104Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆149Updated 6 months ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆234Updated last week
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆74Updated last year
- A strongly typed Python DSL for developing message passing multi agent systems☆52Updated last year
- Demo of ConversationEntityMemory in Streamlit.☆52Updated 2 years ago
- ☆75Updated last year
- Fine-tuning and serving LLMs on any cloud☆89Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆50Updated 6 months ago
- ☆57Updated last year
- Prompt engineering, automated.☆299Updated 3 weeks ago
- Fluid Database☆114Updated 6 months ago
- Python client library for improving your LLM app accuracy☆97Updated 2 months ago
- Track the progress of LLM context utilisation☆54Updated 8 months ago
- LLMON (pronounced limón) is a structured data format optimized for large language models☆34Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- Superpipe - optimized LLM pipelines for structured data☆108Updated 9 months ago
- ☆27Updated 9 months ago
- ☆194Updated 11 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆49Updated 2 weeks ago
- Verdict is a library for scaling judge-time compute.☆195Updated 3 weeks ago
- Mixing Language Models with Self-Verification and Meta-Verification☆103Updated 4 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆76Updated 2 months ago
- Hosted embedding platform to discover, evaluate, and retrieve embeddings☆73Updated last year