confident-ai / deepevalLinks
The LLM Evaluation Framework
☆8,370Updated this week
Alternatives and similar repositories for deepeval
Users that are interested in deepeval are comparing it to the libraries listed below
Sorting:
- Supercharge Your LLM Application Evaluations 🚀☆9,535Updated this week
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…☆7,222Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,634Updated last month
- An awesome & curated list of best LLMOps tools for developers☆4,981Updated last month
- Adding guardrails to large language models.☆5,104Updated 2 weeks ago
- Build resilient language agents as graphs.☆14,228Updated last week
- Agent Framework / shim to use Pydantic with LLMs☆10,271Updated this week
- AI Observability & Evaluation☆5,972Updated this week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆12,700Updated this week
- Harness LLMs with Multi-Agent Programming☆3,402Updated this week
- Evaluation and Tracking for LLM Experiments☆2,570Updated this week
- structured outputs for llms☆10,747Updated last week
- DSPy: The framework for programming—not prompting—language models☆25,466Updated this week
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆2,831Updated last week
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆4,806Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆11,525Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆6,041Updated 2 weeks ago
- A collection of examples that show how to use CrewAI framework to automate workflows.☆4,312Updated last week
- Knowledge Agents and Management in the Cloud☆4,014Updated this week
- Build effective agents using Model Context Protocol and simple workflow patterns☆5,817Updated this week
- Build Conversational AI in minutes ⚡️☆9,967Updated 2 weeks ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…