confident-ai / deepeval
The LLM Evaluation Framework
☆3,541Updated this week
Related projects ⓘ
Alternatives and complementary repositories for deepeval
- Supercharge Your LLM Application Evaluations 🚀☆7,138Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,031Updated 2 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,077Updated last week
- Evaluation and Tracking for LLM Experiments☆2,141Updated this week
- Harness LLMs with Multi-Agent Programming☆2,611Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,207Updated 2 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,601Updated 2 months ago
- Adding guardrails to large language models.☆4,053Updated this week
- A comprehensive guide to building RAG-based LLM applications for production.☆1,712Updated 3 months ago
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llam…☆6,302Updated this week
- Build resilient language agents as graphs.☆6,531Updated this week
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,700Updated 2 months ago
- The all-in-one solution for RAG. Build, scale, and deploy state of the art Retrieval-Augmented Generation applications☆3,553Updated this week
- Developer APIs to Accelerate LLM Projects☆1,419Updated 3 weeks ago
- ☆2,732Updated last month
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…☆4,650Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,178Updated this week
- DSPy: The framework for programming—not prompting—foundation models☆18,587Updated this week
- AI Observability & Evaluation☆3,865Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆9,032Updated this week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,514Updated 3 weeks ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆1,941Updated this week
- structured outputs for llms☆8,068Updated this week
- Parse files for optimal RAG☆3,019Updated this week
- A blazing fast inference solution for text embeddings models☆2,806Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,609Updated this week
- ☆3,944Updated 7 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,615Updated this week
- Tools for merging pretrained large language models.☆4,788Updated this week
- An LLM-powered advanced RAG pipeline built from scratch☆796Updated 9 months ago