Giskard-AI / giskard-ossLinks
π’ Open-Source Evaluation & Testing library for LLM Agents
β5,001Updated 3 weeks ago
Alternatives and similar repositories for giskard-oss
Users that are interested in giskard-oss are comparing it to the libraries listed below
Sorting:
- Evaluation and Tracking for LLM Experiments and AI Agentsβ2,955Updated 2 weeks ago
- Adding guardrails to large language models.β6,103Updated this week
- The LLM Evaluation Frameworkβ12,471Updated this week
- π LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). π Extracts signals from prompts & responses, ensuring saβ¦β965Updated last year
- AdalFlow: The library to build & auto-optimize LLM applications.β3,905Updated last week
- Seamlessly integrate LLMs into scikit-learn.β3,486Updated last week
- LLM Prompt Injection Detectorβ1,386Updated last year
- UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured chβ¦β2,334Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,789Updated 6 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,770Updated last week
- A comprehensive guide to building RAG-based LLM applications for production.β1,842Updated last year
- Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.β885Updated 9 months ago
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.β5,379Updated this week
- A language for constraint-guided and efficient LLM programming.β4,091Updated 6 months ago
- The Security Toolkit for LLM Interactionsβ2,314Updated this week
- Supercharge Your LLM Application Evaluations πβ11,676Updated this week
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,946Updated 2 weeks ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroβ¦β2,975Updated last year
- An awesome & curated list of best LLMOps tools for developersβ5,469Updated last month
- β2,088Updated 2 weeks ago
- AI Observability & Evaluationβ7,882Updated this week
- Interactively explore unstructured datasets from your dataframe.β1,208Updated last week
- A real world full-stack application using LlamaIndexβ2,573Updated 8 months ago
- Automatic Generation of Visualizations and Infographics using Large Language Modelsβ3,180Updated last year
- AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accβ¦β1,470Updated last week
- Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.β864Updated last year
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,984Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ3,552Updated 6 months ago
- Inspect: A framework for large language model evaluationsβ1,554Updated this week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastrβ¦β1,863Updated last week