Giskard-AI / giskard
🐢 Open-Source Evaluation & Testing for ML models & LLMs
☆3,917Updated this week
Related projects: ⓘ
- AI Observability & Evaluation☆3,465Updated this week
- Adding guardrails to large language models.☆3,873Updated this week
- Evaluation and Tracking for LLM Experiments☆2,050Updated this week
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…☆4,239Updated this week
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,627Updated last month
- AdalFlow: The library to build & auto-optimize any LLM tasks.☆1,325Updated this week
- structured outputs for llms☆7,529Updated this week
- Build Conversational AI in minutes ⚡️☆6,787Updated this week
- Harness LLMs with Multi-Agent Programming☆2,293Updated last week
- Seamlessly integrate LLMs into scikit-learn.☆3,251Updated 2 weeks ago
- An awesome & curated list of best LLMOps tools for developers☆3,730Updated 2 weeks ago
- 🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring sa…☆816Updated last month
- A language for constraint-guided and efficient LLM programming.☆3,615Updated 3 months ago
- The LLM Evaluation Framework☆2,981Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆3,797Updated this week
- ZenML 🙏: The bridge between ML and Ops. https://zenml.io.☆3,936Updated this week
- Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines☆6,591Updated this week
- Interactively explore unstructured datasets from your dataframe.☆1,101Updated last month
- dstack is an open-source alternative to Kubernetes, designed to simplify development, training, and deployment of AI across any cloud or …☆1,320Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆2,817Updated 2 weeks ago
- Structured Text Generation☆8,241Updated this week
- A curated list of awesome MLOps tools☆3,962Updated 3 weeks ago
- DSPy: The framework for programming—not prompting—foundation models☆16,773Updated this week
- Open-source end-to-end LLM Development Platform☆957Updated this week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llam…☆5,639Updated this week
- Go ahead and axolotl questions☆7,554Updated this week
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆3,988Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆8,484Updated this week
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,567Updated this week
- Build resilient language agents as graphs.☆5,662Updated this week