relari-ai / continuous-evalLinks
Data-Driven Evaluation for LLM-Powered Applications
☆493Updated 4 months ago
Alternatives and similar repositories for continuous-eval
Users that are interested in continuous-eval are comparing it to the libraries listed below
Sorting:
- Prompt engineering, automated.☆321Updated last month
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆857Updated last year
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆435Updated last week
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆486Updated last year
- LLM fine-tuning and eval☆346Updated last year
- Python SDK for running evaluations on LLM generated responses☆280Updated 2 weeks ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆322Updated last year
- Fine-tuning and serving LLMs on any cloud☆89Updated last year
- A tool for evaluating LLMs☆419Updated last year
- VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…☆693Updated last year
- Action library for AI Agent☆214Updated 2 months ago
- Agent accuracy measurements for LLMs☆204Updated 11 months ago
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,616Updated this week
- Self-hardening firewall for large language models☆265Updated last year
- A simple DAG for executing LLM calls and using tools.☆41Updated last year
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆295Updated 5 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆239Updated 7 months ago
- ☆744Updated last year
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆374Updated last year
- data cleaning and curation for unstructured text☆327Updated 9 months ago
- Complex LLM Workflows from Simple JSON.☆300Updated last year
- Synthetic Data for LLM Fine-Tuning☆116Updated last year
- ☆194Updated last year
- 🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.☆518Updated this week
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆242Updated last week
- Get 100% uptime, reliability from OpenAI. Handle Rate Limit, Timeout, API, Keys Errors☆652Updated last year
- Automatically reformat any JSON into any schema with AI☆330Updated 2 months ago
- Large language model evaluation and workflow framework from Phase AI.☆454Updated 4 months ago
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆319Updated last month
- An LLM-powered advanced RAG pipeline built from scratch☆840Updated last year