relari-ai / continuous-evalLinks
Data-Driven Evaluation for LLM-Powered Applications
☆506Updated 8 months ago
Alternatives and similar repositories for continuous-eval
Users that are interested in continuous-eval are comparing it to the libraries listed below
Sorting:
- Legacy project☆438Updated 2 months ago
- Python SDK for running evaluations on LLM generated responses☆292Updated 3 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆861Updated last year
- Prompt engineering, automated.☆343Updated 5 months ago
- Fine-tuning and serving LLMs on any cloud☆90Updated last year
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆260Updated last week
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆508Updated last year
- VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…☆696Updated last year
- Action library for AI Agent☆224Updated 6 months ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆331Updated last year
- A tool for evaluating LLMs☆424Updated last year
- Agent accuracy measurements for LLMs☆204Updated last year
- LLM fine-tuning and eval☆346Updated last year
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆1,024Updated 10 months ago
- ☆746Updated last year
- An LLM-powered advanced RAG pipeline built from scratch☆853Updated last year
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 2 months ago
- Self-hardening firewall for large language models☆265Updated last year
- 🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.☆562Updated this week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,799Updated last week
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆387Updated last year
- Enforce structured output from LLMs 100% of the time☆250Updated last year
- A simple DAG for executing LLM calls and using tools.☆41Updated 2 years ago
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆312Updated 9 months ago
- Automatically reformat any JSON into any schema with AI☆335Updated 6 months ago
- A realtime serving engine for Data-Intensive Generative AI Applications☆1,054Updated this week
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆357Updated last week
- Build robust LLM applications with true composability 🔗☆419Updated last year
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆399Updated last year
- PostgreSQL vector database extension for building AI applications☆864Updated 9 months ago