Fiddler Auditor is a tool to evaluate language models.
β191Mar 11, 2024Updated 2 years ago
Alternatives and similar repositories for fiddler-auditor
Users that are interested in fiddler-auditor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Research notes and extra resources for all the work at explodinggradients.comβ27Mar 11, 2025Updated last year
- π LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). π Extracts signals from prompts & responses, ensuring saβ¦β990Nov 22, 2024Updated last year
- LLM evaluation.β16Nov 7, 2023Updated 2 years ago
- Coffee Chat Voice Assistant is a voice-driven ordering system powered by Azure OpenAI GPT-4o Realtime API, simulating the experience of oβ¦β31May 4, 2026Updated last month
- python jupyter notebook tutorialsβ13Apr 14, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A tool for evaluating LLMsβ429Mar 15, 2026Updated 2 months ago
- β20Sep 7, 2019Updated 6 years ago
- An end-to-end benchmark suite of multi-modal DNN applications for system-architecture co-designβ22Dec 13, 2024Updated last year
- Just a bunch of benchmark logs for different LLMsβ127Jul 28, 2024Updated last year
- Jump ReLUβ12Apr 8, 2019Updated 7 years ago
- π Unstructured Data Connectors for Haystack 2.0β17Sep 21, 2023Updated 2 years ago
- A complete guide to evaluate LLMs and RAGs. Both theory and code based approaches covered.β28Nov 16, 2023Updated 2 years ago
- Sample project to get started with dbt-power-user vscode extension using dev-containerβ12Apr 5, 2024Updated 2 years ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β4,019Dec 28, 2025Updated 5 months ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Generating Realistic Synthetic Dataβ45Feb 15, 2024Updated 2 years ago
- Sample notebooks and prompts for LLM evaluationβ160Nov 2, 2025Updated 7 months ago
- Build, evaluate, understand, and fix LLM-based appsβ491Jan 16, 2024Updated 2 years ago
- π’ Open-Source Evaluation & Testing library for LLM Agentsβ5,420Jun 5, 2026Updated last week
- IBM Quantum Challenge Fall 2023β10May 23, 2023Updated 3 years ago
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ43Feb 15, 2024Updated 2 years ago
- AI Observability & Evaluationβ10,091Updated this week
- A library for red-teaming LLM applications with LLMs.β29Oct 11, 2024Updated last year
- Python SDK for running evaluations on LLM generated responsesβ300Jun 6, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Sample solution to automate tedious regulatory compliance processes using multi-agent systemsβ28Apr 15, 2025Updated last year
- This demo shows a multi-turn conversation with an AI agent running inside a Temporal workflow.β34Sep 28, 2025Updated 8 months ago
- LLM Prompt Injection Detectorβ1,499Aug 7, 2024Updated last year
- Adding guardrails to large language models.β6,995Updated this week
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.β6,362Updated this week
- This sample shows how to use a Cosmos DB Trigger in Azure Functions Triggers (C# or Python) to automatically generate embeddings on data β¦β21Mar 11, 2025Updated last year
- Building effective agents patterns implemented with Dapr Agentsβ11May 21, 2025Updated last year
- Benchmarks of artificial neural network library for Spark MLlibβ11Dec 3, 2015Updated 10 years ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.β325Jul 10, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Benchmark suite for LLMs from Fireworks.aiβ104Updated this week
- Whispers in the Machine: Confidentiality in Agentic Systemsβ44Apr 20, 2026Updated last month
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β7,590May 2, 2026Updated last month
- the LLM vulnerability scannerβ8,035Updated this week
- New ways of breaking app-integrated LLMsβ2,098Jul 17, 2025Updated 10 months ago
- This repository contains the Julia code for the paper "Competitive Gradient Descent"β25Dec 18, 2019Updated 6 years ago
- β55Aug 22, 2025Updated 9 months ago