athina-ai / ariadneLinks

LLM Evals for Text Summarization and RAG use-cases.

☆35

Alternatives and similar repositories for ariadne

Users that are interested in ariadne are comparing it to the libraries listed below

Sorting:

athina-ai / athina-evals
Python SDK for running evaluations on LLM generated responses
☆286Updated 2 weeks ago
BerriAI / bettertest
☆75Updated last year
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆150Updated 8 months ago
fw-ai / cookbook
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
☆113Updated this week
braintrustdata / braintrust-cookbook
☆33Updated 3 weeks ago
redotvideo / pluto
Synthetic Data for LLM Fine-Tuning
☆120Updated last year
Trainy-ai / llm-atc
Fine-tuning and serving LLMs on any cloud
☆90Updated last year
parlance-labs / langfree
Leverage your LangChain trace data for fine tuning
☆41Updated 10 months ago
fadynakhla / dr-claude
Anthropic Claude2 Hackathon:Building MCTS with Claude for optimal action prediction during patient/doctor interactions.
☆105Updated last year
CyrusNuevoDia / llegos
A strongly typed Python DSL for developing message passing multi agent systems
☆53Updated last year
instructor-ai / evals
Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.
☆52Updated 8 months ago
jxnl / n-levels-of-rag
☆195Updated last year
interlocklabs / trellis
A simple DAG for executing LLM calls and using tools.
☆41Updated last year
villagecomputing / superopenai
Logging and caching superpowers for the openai sdk
☆105Updated last year
eugeneyan / align-app
☆72Updated 7 months ago
parea-ai / parea-sdk-py
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
☆78Updated 4 months ago
andrewnguonly / ChatAbstractions
LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!
☆81Updated last year
NirantK / agentai
Text to Python Objects via a LLM Function Call
☆58Updated last year
villagecomputing / superpipe
Superpipe - optimized LLM pipelines for structured data
☆110Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated 10 months ago
getmetal / Metal
The AI-first datastore & retrieval engine.
☆34Updated 7 months ago
PrithivirajDamodaran / Route0x
Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da
☆105Updated 2 months ago
stunningpixels / lou-eval
Track the progress of LLM context utilisation
☆54Updated 2 months ago
seanchatmangpt / dspygen
A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.
☆126Updated 8 months ago
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆99Updated last year
cohere-ai / sandbox-grounded-qa
A sandbox repo for grounded question answering with Cohere and Google Search
☆136Updated last year
AI-Northstar-Tech / vector-io
Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…
☆244Updated 2 weeks ago
sugarcane-ai / sugarcane-ai
npm like package ecosystem for Prompts 🤖
☆49Updated 4 months ago
hdresearch / nolita
Work with web-enabled agents quickly — whether running a quick task or bootstrapping a full-stack product.
☆93Updated 7 months ago
davanstrien / data-for-fine-tuning-llms
☆77Updated last year