Giskard-AI / giskardLinks
π’ Open-Source Evaluation & Testing for AI & LLM systems
β4,651Updated last week
Alternatives and similar repositories for giskard
Users that are interested in giskard are comparing it to the libraries listed below
Sorting:
- Adding guardrails to large language models.β5,171Updated 3 weeks ago
- π LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). π Extracts signals from prompts & responses, ensuring saβ¦β924Updated 7 months ago
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.β4,837Updated this week
- Evaluation and Tracking for LLM Experiments and AI Agentsβ2,586Updated this week
- LLM abstractions that aren't obstructionsβ1,191Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,531Updated last month
- LLM Prompt Injection Detectorβ1,306Updated 10 months ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroβ¦β2,887Updated 10 months ago
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Geβ¦β7,366Updated this week
- The Security Toolkit for LLM Interactionsβ1,781Updated 2 weeks ago
- Deliver safe & effective language modelsβ526Updated this week
- Inspect: A framework for large language model evaluationsβ1,096Updated this week
- dstack is an open-source alternative to Kubernetes and Slurm, designed to simplify GPU allocation and AI workload orchestration for ML teβ¦β1,812Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ2,163Updated this week
- A comprehensive guide to building RAG-based LLM applications for production.β1,798Updated 10 months ago
- Interactively explore unstructured datasets from your dataframe.β1,183Updated 2 weeks ago
- Robust recipes to align language models with human and AI preferencesβ5,235Updated 2 months ago
- The LLM Evaluation Frameworkβ8,464Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ3,028Updated last month
- Reference implementations of several LangChain agents as Streamlit appsβ1,495Updated 10 months ago
- πΈ Open-Source Evaluation & Testing for Computer Vision AI systemsβ29Updated 8 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.β2,651Updated this week
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,825Updated 3 weeks ago
- β457Updated last year
- π¦ Integrating LLMs into structured NLP pipelinesβ1,267Updated 5 months ago
- π A curated list of papers & technical articles on AI Quality & Safetyβ184Updated 2 months ago
- β1,843Updated this week
- Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models β¦β2,301Updated this week
- An open-source visual programming environment for battle-testing prompts to LLMs.β2,655Updated last month
- Efficient few-shot learning with Sentence Transformersβ2,512Updated 2 months ago