confident-ai / deepevalLinks

The LLM Evaluation Framework

☆9,277

Alternatives and similar repositories for deepeval

Users that are interested in deepeval are comparing it to the libraries listed below

Sorting:

explodinggradients / ragas
Supercharge Your LLM Application Evaluations 🚀
☆9,940Updated this week
SylphAI-Inc / AdalFlow
AdalFlow: The library to build & auto-optimize LLM applications.
☆3,447Updated this week
run-llama / llama_cloud_services
Knowledge Agents and Management in the Cloud
☆4,066Updated this week
promptfoo / promptfoo
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…
☆7,567Updated this week
guardrails-ai / guardrails
Adding guardrails to large language models.
☆5,302Updated this week
Arize-ai / phoenix
AI Observability & Evaluation
☆6,372Updated this week
Marker-Inc-Korea / AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
☆4,108Updated 2 weeks ago
langroid / langroid
Harness LLMs with Multi-Agent Programming
☆3,487Updated this week
truera / trulens
Evaluation and Tracking for LLM Experiments and AI Agents
☆2,647Updated this week
pydantic / pydantic-ai
Agent Framework / shim to use Pydantic with LLMs
☆11,161Updated this week
MadcowD / ell
A language model programming library.
☆5,801Updated last month
aurelio-labs / semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
☆2,681Updated this week
567-labs / instructor
structured outputs for llms
☆11,016Updated this week
langchain-ai / langgraph
Build resilient language agents as graphs.
☆15,948Updated this week
awslabs / agent-squad
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
☆6,271Updated 3 weeks ago
lm-sys / RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,115Updated 11 months ago
circlemind-ai / fast-graphrag
RAG that intelligently adapts to your use case, data, and queries
☆3,382Updated last month
stanfordnlp / dspy
DSPy: The framework for programming—not prompting—language models
☆26,530Updated this week
dottxt-ai / outlines
Structured Outputs
☆12,120Updated this week
langchain-ai / langgraph-studio
Desktop app for prototyping and debugging LangGraph applications locally.
☆3,074Updated 3 weeks ago
AnswerDotAI / RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,581Updated 2 months ago
microsoft / LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…
☆5,272Updated 4 months ago
Unstructured-IO / unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…
☆11,934Updated this week
langfuse / langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…
☆13,751Updated this week
tensorchord / Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
☆5,100Updated 3 weeks ago
microsoft / PromptWizard
Task-Aware Agent-driven Prompt Optimization Framework
☆3,404Updated last week
BrainBlend-AI / atomic-agents
Building AI agents, atomically
☆4,391Updated this week
BoundaryML / baml
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
☆4,729Updated this week
ag2ai / ag2
AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/pAbnFJrkgZ
☆3,022Updated this week
Agenta-AI / agenta
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
☆2,946Updated this week