fiddler-labs / fiddler-auditorLinks
Fiddler Auditor is a tool to evaluate language models.
☆184Updated last year
Alternatives and similar repositories for fiddler-auditor
Users that are interested in fiddler-auditor are comparing it to the libraries listed below
Sorting:
- A tool for evaluating LLMs☆424Updated last year
- Sample notebooks and prompts for LLM evaluation☆138Updated last month
- Python SDK for running evaluations on LLM generated responses☆291Updated 2 months ago
- ☆173Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆113Updated last week
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆315Updated 3 weeks ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆130Updated this week
- ☆166Updated this week
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆188Updated 3 months ago
- ☆185Updated last year
- Red-Teaming Language Models with DSPy☆203Updated 5 months ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆398Updated last year
- 🦜💯 Flex those feathers!☆253Updated 9 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆245Updated 10 months ago
- ☆89Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆82Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆147Updated last year
- ☆87Updated last year
- Framework for building data agent workflows☆82Updated 11 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆78Updated 5 months ago
- ☆71Updated 9 months ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆94Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆325Updated 8 months ago
- A Lightweight Library for AI Observability☆250Updated 5 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Framework for LLM evaluation, guardrails and security☆112Updated 10 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 9 months ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- Leverage your LangChain trace data for fine tuning☆42Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year