relari-ai / continuous-evalLinks
Data-Driven Evaluation for LLM-Powered Applications
☆513Updated 10 months ago
Alternatives and similar repositories for continuous-eval
Users that are interested in continuous-eval are comparing it to the libraries listed below
Sorting:
- Legacy project of an analytics platform for LLM-generated content☆437Updated 4 months ago
- Prompt engineering, automated.☆349Updated 7 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆864Updated last year
- Python SDK for running evaluations on LLM generated responses☆293Updated 5 months ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆515Updated last year
- Fine-tuning and serving LLMs on any cloud☆90Updated 2 years ago
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆264Updated last week
- VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…☆695Updated last year
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆333Updated last year
- An LLM-powered advanced RAG pipeline built from scratch☆854Updated last year
- LLM fine-tuning and eval☆344Updated last year
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆319Updated 4 months ago
- ☆747Updated last year
- Action library for AI Agent☆229Updated 8 months ago
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆1,023Updated last year
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,035Updated 9 months ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆398Updated 2 years ago
- Agent accuracy measurements for LLMs☆204Updated last year
- A tool for evaluating LLMs☆428Updated last year
- A realtime serving engine for Data-Intensive Generative AI Applications☆1,074Updated this week
- 🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.☆579Updated this week
- Enforce structured output from LLMs 100% of the time☆248Updated last year
- Logging and caching superpowers for the openai sdk☆105Updated last year
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆247Updated last year
- Complex LLM Workflows from Simple JSON.☆316Updated 2 years ago
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆327Updated 11 months ago
- Get 100% uptime, reliability from OpenAI. Handle Rate Limit, Timeout, API, Keys Errors☆690Updated 2 years ago
- PostgreSQL vector database extension for building AI applications☆869Updated 11 months ago
- Prompt engineering for developers☆693Updated last year
- A simple Python sandbox for helpful LLM data agents☆294Updated last year