Giskard-AI / giskard-ossView external linksLinks
π’ Open-Source Evaluation & Testing library for LLM Agents
β5,111Feb 6, 2026Updated last week
Alternatives and similar repositories for giskard-oss
Users that are interested in giskard-oss are comparing it to the libraries listed below
Sorting:
- AI Observability & Evaluationβ8,530Updated this week
- The LLM Evaluation Frameworkβ13,613Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,852Updated this week
- Supercharge Your LLM Application Evaluations πβ12,605Jan 31, 2026Updated 2 weeks ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,976Dec 28, 2025Updated last month
- Adding guardrails to large language models.β6,399Updated this week
- DSPy: The framework for programmingβnot promptingβlanguage modelsβ32,156Updated this week
- Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude,β¦β10,462Updated this week
- Structured Outputsβ13,403Feb 6, 2026Updated last week
- Evaluation and Tracking for LLM Experiments and AI Agentsβ3,082Updated this week
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.β5,650Updated this week
- AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convertβ¦β24,162Updated this week
- structured outputs for llmsβ12,357Updated this week
- The Security Toolkit for LLM Interactionsβ2,537Dec 15, 2025Updated 2 months ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into cleanβ¦β13,973Updated this week
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β7,111Updated this week
- πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Openβ¦β21,935Updated this week
- Build Conversational AI in minutes β‘οΈβ11,558Feb 3, 2026Updated last week
- ZenML π: One AI Platform from Pipelines to Agents. https://zenml.io.β5,202Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,558Jul 14, 2025Updated 7 months ago
- A guidance language for controlling large language models.β21,270Feb 6, 2026Updated last week
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data β¦β11,309Jan 13, 2026Updated last month
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,130Updated this week
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroβ¦β3,003Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.β13,155Feb 8, 2026Updated last week
- the LLM vulnerability scannerβ6,948Feb 5, 2026Updated last week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β46,977Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aβ¦β35,968Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,852May 17, 2025Updated 8 months ago
- The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engβ¦β3,408Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,717Feb 9, 2026Updated last week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.β21,024Jan 29, 2026Updated 2 weeks ago
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,β¦β17,744Updated this week
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, oβ¦β9,442Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ70,205Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β12,102Updated this week
- A language for constraint-guided and efficient LLM programming.β4,148May 22, 2025Updated 8 months ago
- Universal memory layer for AI Agentsβ47,230Feb 3, 2026Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ3,718May 21, 2025Updated 8 months ago