relari-ai / continuous-eval
Data-Driven Evaluation for LLM-Powered Applications
☆487Updated 2 months ago
Alternatives and similar repositories for continuous-eval:
Users that are interested in continuous-eval are comparing it to the libraries listed below
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆482Updated 11 months ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆320Updated 10 months ago
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆427Updated 3 weeks ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆854Updated last year
- Python SDK for running evaluations on LLM generated responses☆276Updated last week
- LLM fine-tuning and eval☆346Updated last year
- Fine-tuning and serving LLMs on any cloud☆89Updated last year
- Prompt engineering, automated.☆299Updated 3 weeks ago
- Action library for AI Agent☆212Updated 2 weeks ago
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,564Updated 2 weeks ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆234Updated last week
- A simple Python sandbox for helpful LLM data agents☆247Updated 9 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆293Updated 5 months ago
- PostgreSQL vector database extension for building AI applications☆840Updated 4 months ago
- HumanLayer enables AI agents to communicate with humans in tool-based and async workflows. Guarantee human oversight of high-stakes funct…☆677Updated 2 weeks ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆367Updated 11 months ago
- VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…☆691Updated 10 months ago
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆1,010Updated 5 months ago
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆292Updated 3 months ago
- ⛓️ build cognitive systems, pythonic☆333Updated 4 months ago
- A tool for evaluating LLMs☆413Updated 11 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆236Updated 6 months ago
- Agent accuracy measurements for LLMs☆205Updated 10 months ago
- Enforce structured output from LLMs 100% of the time☆249Updated 8 months ago
- An LLM-powered advanced RAG pipeline built from scratch☆831Updated last year
- High-performance retrieval engine for unstructured data☆1,316Updated this week
- An Awesome list of curated DSPy resources.☆305Updated last month
- pykoi: Active learning in one unified interface☆410Updated last year
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆395Updated last year
- A realtime serving engine for Data-Intensive Generative AI Applications☆987Updated this week