confident-ai / deepeval
The LLM Evaluation Framework
☆3,834Updated this week
Alternatives and similar repositories for deepeval:
Users that are interested in deepeval are comparing it to the libraries listed below
- Supercharge Your LLM Application Evaluations 🚀☆7,411Updated this week
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…☆4,881Updated this week
- Evaluation and Tracking for LLM Experiments☆2,200Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,156Updated this week
- AI Observability & Evaluation☆4,056Updated this week
- Adding guardrails to large language models.☆4,185Updated this week
- Developer APIs to Accelerate LLM Projects☆1,452Updated last month
- structured outputs for llms☆8,390Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,085Updated 2 months ago
- Parse files for optimal RAG☆3,284Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,686Updated 2 weeks ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆1,871Updated last week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llam…☆6,823Updated this week
- Harness LLMs with Multi-Agent Programming☆2,728Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,303Updated 3 months ago
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆4,231Updated this week
- DSPy: The framework for programming—not prompting—language models☆19,401Updated this week
- Build resilient language agents as graphs.☆6,953Updated this week
- Go ahead and axolotl questions☆8,022Updated this week
- Deploy your agentic worfklows to production☆1,857Updated this week
- Structured Text Generation☆9,856Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,687Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆2,238Updated this week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,475Updated 9 months ago
- An awesome & curated list of best LLMOps tools for developers☆4,065Updated this week
- LangServe 🦜️🏓☆1,961Updated last week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,309Updated this week
- Open-source tool to visualise your RAG 🔮☆1,089Updated 8 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,122Updated 3 weeks ago
- ☆2,772Updated 2 months ago