Giskard-AI / giskard-ossLinks
π’ Open-Source Evaluation & Testing library for LLM Agents
β4,950Updated last week
Alternatives and similar repositories for giskard-oss
Users that are interested in giskard-oss are comparing it to the libraries listed below
Sorting:
- Adding guardrails to large language models.β5,842Updated this week
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.β5,177Updated this week
- π LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). π Extracts signals from prompts & responses, ensuring saβ¦β951Updated 11 months ago
- The Security Toolkit for LLM Interactionsβ2,193Updated this week
- An awesome & curated list of best LLMOps tools for developersβ5,374Updated last week
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,927Updated 2 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,729Updated last week
- LLM Prompt Injection Detectorβ1,362Updated last year
- Evaluation and Tracking for LLM Experiments and AI Agentsβ2,862Updated last week
- AdalFlow: The library to build & auto-optimize LLM applications.β3,840Updated 3 weeks ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroβ¦β2,944Updated last year
- ZenML π: MLOps for Reliable AI: from Classical ML to Agents. https://zenml.io.β4,963Updated this week
- AI Observability & Evaluationβ7,451Updated this week
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.β1,012Updated this week
- Deliver safe & effective language modelsβ545Updated this week
- Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude,β¦β8,834Updated this week
- A language for constraint-guided and efficient LLM programming.β4,073Updated 5 months ago
- The LLM Evaluation Frameworkβ11,787Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,723Updated 5 months ago
- Kickstart your MLOps initiative with a flexible, robust, and productive Python package.β1,361Updated last week
- Interactively explore unstructured datasets from your dataframe.β1,200Updated last week
- dstack is an open-source control plane for running development, training, and inference jobs on GPUsβacross hyperscalers, neoclouds, or oβ¦β1,935Updated this week
- Harness LLMs with Multi-Agent Programmingβ3,730Updated last week
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ2,446Updated last week
- UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured chβ¦β2,327Updated last year
- π¦ Integrating LLMs into structured NLP pipelinesβ1,328Updated 9 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.β2,861Updated 3 weeks ago
- Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.β883Updated 8 months ago
- Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.β859Updated last year
- A comprehensive guide to building RAG-based LLM applications for production.β1,835Updated last year