Giskard-AI / giskardLinks

🐢 Open-Source Evaluation & Testing for AI & LLM systems

☆4,651

Alternatives and similar repositories for giskard

Users that are interested in giskard are comparing it to the libraries listed below

Sorting:

guardrails-ai / guardrails
Adding guardrails to large language models.
☆5,171Updated 3 weeks ago
whylabs / langkit
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring sa…
☆924Updated 7 months ago
NVIDIA / NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
☆4,837Updated this week
truera / trulens
Evaluation and Tracking for LLM Experiments and AI Agents
☆2,586Updated this week
Mirascope / mirascope
LLM abstractions that aren't obstructions
☆1,191Updated this week
AnswerDotAI / RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,531Updated last month
protectai / rebuff
LLM Prompt Injection Detector
☆1,306Updated 10 months ago
hegelai / prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…
☆2,887Updated 10 months ago
promptfoo / promptfoo
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…
☆7,366Updated this week
protectai / llm-guard
The Security Toolkit for LLM Interactions
☆1,781Updated 2 weeks ago
JohnSnowLabs / langtest
Deliver safe & effective language models
☆526Updated this week
UKGovernmentBEIS / inspect_ai
Inspect: A framework for large language model evaluations
☆1,096Updated this week
dstackai / dstack
dstack is an open-source alternative to Kubernetes and Slurm, designed to simplify GPU allocation and AI workload orchestration for ML te…
☆1,812Updated this week
qdrant / fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
☆2,163Updated this week
ray-project / llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
☆1,798Updated 10 months ago
Renumics / spotlight
Interactively explore unstructured datasets from your dataframe.
☆1,183Updated 2 weeks ago
huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆5,235Updated 2 months ago
confident-ai / deepeval
The LLM Evaluation Framework
☆8,464Updated this week
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,028Updated last month
langchain-ai / streamlit-agent
Reference implementations of several LangChain agents as Streamlit apps
☆1,495Updated 10 months ago
Giskard-AI / giskard-vision
📸 Open-Source Evaluation & Testing for Computer Vision AI systems
☆29Updated 8 months ago
aurelio-labs / semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
☆2,651Updated this week
deepchecks / deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…
☆3,825Updated 3 weeks ago
philschmid / easyllm
☆457Updated last year
explosion / spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
☆1,267Updated 5 months ago
Giskard-AI / awesome-ai-safety
📚 A curated list of papers & technical articles on AI Quality & Safety
☆184Updated 2 months ago
mistralai / cookbook
☆1,843Updated this week
stanford-crfm / helm
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models …
☆2,301Updated this week
ianarawjo / ChainForge
An open-source visual programming environment for battle-testing prompts to LLMs.
☆2,655Updated last month
huggingface / setfit
Efficient few-shot learning with Sentence Transformers
☆2,512Updated 2 months ago