relari-ai / continuous-eval
Data-Driven Evaluation for LLM-Powered Applications
☆484Updated 2 months ago
Alternatives and similar repositories for continuous-eval:
Users that are interested in continuous-eval are comparing it to the libraries listed below
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆479Updated 11 months ago
- Python SDK for running evaluations on LLM generated responses☆272Updated this week
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆853Updated last year
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆425Updated 2 months ago
- Prompt engineering, automated.☆288Updated 4 months ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆320Updated 10 months ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆230Updated this week
- LLM fine-tuning and eval☆344Updated last year
- VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…☆689Updated 10 months ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆363Updated 10 months ago
- Action library for AI Agent☆211Updated this week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,542Updated this week
- Agent accuracy measurements for LLMs☆205Updated 9 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆148Updated 5 months ago
- 🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and mo…☆1,016Updated 2 months ago
- A tool for evaluating LLMs☆407Updated 10 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆288Updated 4 months ago
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆289Updated 2 months ago
- Fine-tuning and serving LLMs on any cloud☆89Updated last year
- Complex LLM Workflows from Simple JSON.☆293Updated last year
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆235Updated 5 months ago
- Enforce structured output from LLMs 100% of the time☆248Updated 8 months ago
- A simple Python sandbox for helpful LLM data agents☆235Updated 9 months ago
- HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes funct…☆649Updated this week
- Task-based Agentic Framework using StrictJSON as the core☆448Updated last month
- Agents Capable of Self-Editing Their Prompts / Python Code☆759Updated last year
- ⛓️ build cognitive systems, pythonic☆331Updated 4 months ago
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆1,007Updated 4 months ago
- data cleaning and curation for unstructured text☆329Updated 7 months ago
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.☆914Updated 2 months ago