relari-ai / continuous-eval
Data-Driven Evaluation for LLM-Powered Applications
☆447Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for continuous-eval
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆444Updated 6 months ago
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆377Updated this week
- Python SDK for running evaluations on LLM generated responses☆221Updated this week
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆836Updated 10 months ago
- Prompt engineering, automated.☆246Updated 2 weeks ago
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,268Updated this week
- LLM fine-tuning and eval☆341Updated 8 months ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆289Updated 6 months ago
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆266Updated last week
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆260Updated this week
- OpenTelemetry Instrumentation for AI Observability☆218Updated this week
- High-performance retrieval engine for unstructured data☆982Updated last week
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆341Updated 6 months ago
- Reliable LLM Memory for AI Applications and AI Agents☆913Updated this week
- A tool for evaluating LLMs☆392Updated 6 months ago
- Source available LLM Ops platform and LLM Optimization Studio powered by DSPy.☆341Updated this week
- Action library for AI Agent☆191Updated 2 weeks ago
- ☆727Updated 7 months ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆219Updated this week
- Fine-tuning and serving LLMs on any cloud☆87Updated 11 months ago
- Agent accuracy measurements for LLMs☆203Updated 5 months ago
- The first AI Agent Server, Eidolon is a pluggable Agent SDK and enterprise ready, deployment server for Agentic applications☆286Updated this week
- 🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.☆390Updated this week
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆390Updated 11 months ago
- A realtime serving engine for Data-Intensive Generative AI Applications☆914Updated this week
- Agents Capable of Self-Editing Their Prompts / Python Code☆745Updated 8 months ago
- Task-based Agentic Framework using StrictJSON as the core☆436Updated last month
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆248Updated this week
- An LLM-powered advanced RAG pipeline built from scratch☆798Updated 9 months ago
- data cleaning and curation for unstructured text☆327Updated 3 months ago