athina-ai / ariadne
LLM Evals for Text Summarization and RAG use-cases.
☆35Updated 7 months ago
Related projects: ⓘ
- Logging and caching superpowers for the openai sdk☆98Updated 6 months ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆91Updated 2 months ago
- ☆172Updated 4 months ago
- Python SDK for running evaluations on LLM generated responses☆196Updated this week
- Synthetic Data for LLM Fine-Tuning☆78Updated 9 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆72Updated last week
- ☆75Updated 7 months ago
- The only Vector tooling you'll need. Star the repo and look out for an email to try out a brand new Vector Data Exploration demo! Use the…☆195Updated this week
- ☆56Updated last week
- A strongly typed Python DSL for developing message passing multi agent systems☆50Updated 5 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆45Updated last week
- ☆58Updated 3 weeks ago
- Fine-tuning and serving LLMs on any cloud☆85Updated 9 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆93Updated 5 months ago
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆150Updated this week
- Prompt engineering, automated.☆201Updated this week
- FastAPI wrapper around DSPy☆201Updated 6 months ago
- ☆47Updated this week
- A simple Python sandbox for helpful LLM data agents☆143Updated 3 months ago
- ☆20Updated 3 months ago
- A simple DAG for executing LLM calls and using tools.☆37Updated last year
- ☆95Updated this week
- GPT-based Conversation Summarizer☆144Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆136Updated 2 weeks ago
- ⛓️ build cognitive systems, pythonic☆321Updated 2 months ago
- Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.☆107Updated last year
- Automating enterprise workflows with multimodal agents☆83Updated last month
- AI For Software Operations☆81Updated this week
- ☆57Updated last year
- Simple Graph Memory for AI applications☆76Updated last month