confident-ai / deepeval
The LLM Evaluation Framework
β4,385Updated this week
Alternatives and similar repositories for deepeval:
Users that are interested in deepeval are comparing it to the libraries listed below
- Supercharge Your LLM Application Evaluations πβ7,889Updated this week
- Parse files for optimal RAGβ3,526Updated last week
- Superfast AI decision making and intelligent processing of multi-modal data.β2,294Updated this week
- AI Observability & Evaluationβ4,483Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ3,442Updated 5 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,172Updated 4 months ago
- Evaluation and Tracking for LLM Experimentsβ2,277Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achβ¦β4,812Updated last month
- Adding guardrails to large language models.β4,362Updated this week
- structured outputs for llmsβ8,909Updated this week
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Geβ¦β5,192Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.β2,474Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β1,879Updated this week
- Harness LLMs with Multi-Agent Programmingβ2,917Updated this week
- πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llamβ¦β7,839Updated this week
- An awesome & curated list of best LLMOps tools for developersβ4,247Updated 3 weeks ago
- LangServe π¦οΈπβ1,993Updated 3 weeks ago
- Structured Text Generationβ10,350Updated this week
- Enforce the output format (JSON Schema, Regex etc) of a language modelβ1,666Updated 3 months ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroβ¦β2,760Updated 5 months ago
- Build resilient language agents as graphs.β8,082Updated this week
- Efficient Retrieval Augmentation and Generation Frameworkβ1,419Updated last week
- MTEB: Massive Text Embedding Benchmarkβ2,086Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ3,518Updated 2 weeks ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.β9,785Updated this week
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.β4,318Updated this week
- A blazing fast inference solution for text embeddings modelsβ3,043Updated last week
- Go ahead and axolotl questionsβ8,293Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ2,311Updated this week
- Tools for merging pretrained large language models.β5,113Updated last week