Giskard-AI/giskard-oss

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Giskard-AI/giskard-oss)

Giskard-AI / giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

☆5,722

Alternatives and similar repositories for giskard-oss

Users that are interested in giskard-oss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Arize-ai / phoenix
View on GitHub
AI Observability & Evaluation
☆10,781Updated this week
argilla-io / argilla
View on GitHub
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆5,060Updated this week
deepchecks / deepchecks
View on GitHub
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…
☆4,041Dec 28, 2025Updated 7 months ago
guardrails-ai / guardrails
View on GitHub
Adding guardrails to large language models.
☆7,217Updated this week
vibrantlabsai / ragas
View on GitHub
Supercharge Your LLM Application Evaluations 🚀
☆15,016Feb 24, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
confident-ai / deepeval
View on GitHub
The LLM Evaluation Framework
☆17,232Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,434Updated this week
NVIDIA-NeMo / Guardrails
View on GitHub
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
☆6,818Updated this week
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆15,419Updated this week
truera / trulens
View on GitHub
Evaluation and Tracking for LLM Experiments and AI Agents
☆3,469Updated this week
evidentlyai / evidently
View on GitHub
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…
☆7,759May 2, 2026Updated 2 months ago
NVIDIA / garak
View on GitHub
the LLM vulnerability scanner
☆8,609Updated this week
deepset-ai / haystack
View on GitHub
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…
☆26,043Updated this week
microsoft / PyRIT
View on GitHub
The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and eng…
☆4,200Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,694May 21, 2026Updated 2 months ago
567-labs / instructor
View on GitHub
structured outputs for llms
☆13,642Jul 13, 2026Updated 2 weeks ago
protectai / llm-guard
View on GitHub
The Security Toolkit for LLM Interactions
☆3,201Jul 8, 2026Updated 2 weeks ago
Unstructured-IO / unstructured
View on GitHub
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…
☆15,210Updated this week
Chainlit / chainlit
View on GitHub
Build Conversational AI in minutes ⚡️
☆12,340Jun 11, 2026Updated last month
weaviate / Verba
View on GitHub
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
☆7,714Jun 8, 2026Updated last month
cleanlab / cleanlab
View on GitHub
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data …
☆11,604Jan 13, 2026Updated 6 months ago
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆51,165Updated this week
Lightning-AI / litgpt
View on GitHub
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆13,594Jul 20, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zenml-io / zenml
View on GitHub
ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.
☆5,525Updated this week
hegelai / prompttools
View on GitHub
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…
☆3,046Feb 11, 2026Updated 5 months ago
neuml / txtai
View on GitHub
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
☆12,761Updated this week
AnswerDotAI / RAGatouille
View on GitHub
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,944May 17, 2025Updated last year
addy-ai / langdrive
View on GitHub
Train LLMs on private data. Simply make an API request to our training endpoint specifying you data and model. LangDrive will handle the …
☆175Aug 9, 2024Updated last year
ShishirPatil / gorilla
View on GitHub
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,968Apr 13, 2026Updated 3 months ago
promptfoo / promptfoo
View on GitHub
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, De…
☆23,657Updated this week
langfuse / langfuse
View on GitHub
🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenT…
☆32,026Updated this week
microsoft / autogen
View on GitHub
A programming framework for agentic AI
☆60,063Apr 15, 2026Updated 3 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
whylabs / langkit
View on GitHub
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring sa…
☆994Nov 22, 2024Updated last year
bentoml / OpenLLM
View on GitHub
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
☆12,420Updated this week
SylphAI-Inc / AdalFlow
View on GitHub
AdalFlow: The library to build & auto-optimize LLM applications.
☆4,188May 29, 2026Updated 2 months ago
eth-sri / lmql
View on GitHub
A language for constraint-guided and efficient LLM programming.
☆4,203May 22, 2025Updated last year
tensorchord / Awesome-LLMOps
View on GitHub
An awesome & curated list of best LLMOps tools for developers
☆5,899May 21, 2026Updated 2 months ago
skypilot-org / skypilot
View on GitHub
The AI Compute Platform for frontier teams. SkyPilot turns fragmented AI compute into one AI supercomputer, so frontier AI teams build cu…
☆10,418Updated this week
protectai / rebuff
View on GitHub
LLM Prompt Injection Detector
☆1,515Aug 7, 2024Updated last year